Influenza hemagglutinin and neuraminidase variants

ABSTRACT

Polypeptides, polynucleotides, methods, compositions, and vaccines comprising (avian pandemic) influenza hemagglutinin and neuraminidase variants are provided.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit under 35 U.S.C. §119 (e) of U.S.Provisional Application Nos. 60/821,832 filed Aug. 9, 2006 and60/942,804, filed Jun. 8, 2007, the disclosures of each of which areincorporated herein in their entirety for all purposes.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has beensubmitted in ASCII format via EFS-Web and is hereby incorporated byreference in its entirety. Said ASCII copy, created on Apr. 26, 2011, isnamed MDI-0438-UT.txt and is 151,676 bytes in size.

BACKGROUND OF THE INVENTION

Vaccines against various and evolving strains of influenza are importantfrom a community health stand point, as well as commercially, since eachyear numerous individuals are infected with different strains and typesof influenza virus. Infants, the elderly, those without adequate healthcare and immuno-compromised persons are at special risk of death fromsuch infections. Compounding the problem of influenza infections is thatnovel influenza strains evolve readily and can spread amongst variousspecies, thereby necessitating the continuous production of newvaccines.

Numerous vaccines capable of producing a protective immune responsespecific for such different and influenza viruses/virus strains havebeen produced for over 50 years and include whole virus vaccines, splitvirus vaccines, surface antigen vaccines and live attenuated virusvaccines. However, while appropriate formulations of any of thesevaccine types are capable of producing a systemic immune response, liveattenuated virus vaccines have the advantage of also being able tostimulate local mucosal immunity in the respiratory tract. Considerablework in the production of influenza viruses, and fragments thereof, forproduction of vaccines has been done by the present inventors andco-workers; see, e.g., U.S. Application Nos. 60/420,708, filed Oct. 23,2002; 60/574,117, filed May 24, 2004; 10/423,828, filed Apr. 25, 2003;60/578,962, filed Jun. 12, 2004; and 10/870,690 filed Jun. 16, 2004, thedisclosure of which is incorporated by reference herein.

Because of the continual emergence (or re-emergence) of differentinfluenza strains, new influenza vaccines are continually desired. Suchvaccines typically are created using antigenic moieties of the newlyemergent virus strains, thus, polypeptides and polynucleotides of novel,newly emergent, or newly re-emergent virus strains (especially sequencesof antigenic genes) are highly desirable.

The present invention provides new and/or newly isolated influenzahemagglutinin and neuraminidase variants that are capable of use inproduction of numerous types of vaccines as well as in research,diagnostics, etc. Numerous other benefits will become apparent uponreview of the following.

SUMMARY OF THE INVENTION

In some aspects herein, the invention comprises an isolated orrecombinant polypeptide that is selected from: any one of thepolypeptides encoded by SEQ ID NO:1 through SEQ ID NO:10 or SEQ ID NO:21through SEQ ID NO:26 or SEQ ID NO:33 through SEQ ID NO:38; any one ofthe polypeptides of SEQ ID NO:11 through SEQ ID NO:20 or SEQ ID NO:27through SEQ ID NO:32 or SEQ ID NO:39 through SEQ ID NO:44; only the openreading frame encoding the polypeptides of SEQ ID NO:11 through SEQ IDNO:20 or SEQ ID NO:27 through SEQ ID NO:32 or SEQ ID NO:39 through SEQID NO:44; any alternative (e.g., the mature form without the signalpeptide, or without the 5′ and 3′ sequences outside of the open readingframe, or the sequences as expressed on the surface of a virus (e.g.,influenza)) form of the polypeptides of SEQ ID NO:11-20 or SEQ IDNO:27-32 or SEQ ID NO:39-44; any polypeptide that is encoded by apolynucleotide sequence which hybridizes under highly stringentconditions over substantially the entire length of a polynucleotidesequence of SEQ ID NO:1 through SEQ ID NO:10 or SEQ ID NO:21 through SEQID NO:26, SEQ ID NO:33-38, or SEQ ID NO:45; any polypeptide that isencoded by a polynucleotide sequence which hybridizes under highlystringent conditions to a polynucleotide sequence of SEQ ID NO:1 throughSEQ ID NO:10 or SEQ ID NO:21 through SEQ ID NO:26 or SEQ ID NO:33through SEQ ID NO:38, or SEQ ID NO:45; and, a fragment of any of theabove wherein the sequence comprises a hemagglutinin or neuraminidasepolypeptide, or a fragment of a hemagglutinin or neuraminidasepolypeptide, preferably where the fragments generate an antibody thatspecifically binds a full length polypeptide of the invention. Invarious embodiments, the isolated or recombinant polypeptides of theinvention are substantially identical to about 300 contiguous amino acidresidues of any of the above polypeptides. In yet other embodiments, theinvention comprises isolated or recombinant polypeptides, that comprisean amino acid sequence that is substantially identical over at leastabout 350 amino acids; over at least about 400 amino acids; over atleast about 450 amino acids; over at least about 500 amino acids; overat least about 520 amino acids; over at least about 550 amino acids;over at least about 559 amino acids; over at least about 565 aminoacids; or over at least about 566 amino acids contiguous of any of theabove polypeptides. In some embodiments, the polypeptide sequence (e.g.,as listed in “SEQUENCES” herein) comprises less than 565, 559, etc.amino acids. In such embodiments, the shorter listed polypeptidesoptionally comprise less than 565, 559, etc. amino acids. In yet otherembodiments, the polypeptides of the invention optionally comprisefusion proteins, proteins with a leader sequence, a precursorpolypeptide, proteins with a secretion signal or a localization signal,or proteins with an epitope tag, an E-tag, or a His epitope tag. Instill other embodiments, the invention comprises a polypeptidecomprising a sequence having at least 85%, at least 90%, at least 93%,at least 95%, at least 98%, at least 98.5%, at least 99%, at least99.2%, at least 99.4%, at least 99.6%, at least 99.8%, or at least 99.9%sequence identity to at least one polypeptide listed above. Thehemagglutinin sequences of the invention can comprise both thosesequences with unmodified and modified polybasic cleavage sites (therebyallowing growth of the viruses in eggs). The hemagglutinin polypeptidesequences of SEQ ID NOS:11, 13, 15, 17, 19, 27, 29, 31, 39, 41, or 43comprise the endogenous amino terminal signal peptide sequences,however, the hemagglutinin polypeptide sequences of the invention alsoinclude the mature (amino terminal signal peptide cleaved) form of thehemagglutinin polypeptides. The cleavage sites of any hemagglutininpolypeptide sequence of any influenza strain can be routinely measuredor predicted using any number of methods in the art.

In other aspects, the invention comprises a composition with one or morepolypeptide listed above, or fragments thereof. The invention alsoincludes polypeptides that are specifically bound by a polyclonalantisera raised against at least 1 antigen that comprises at least oneamino acid sequence described above, or a fragment thereof. Suchantibodies specific for the polypeptides described above are alsofeatures of the invention. The polypeptides of the invention areoptionally immunogenic.

The invention also encompasses immunogenic compositions comprising animmunologically effective amount of one or more of any of thepolypeptides described above as well as methods for stimulating theimmune system of an individual to produce a protective immune responseagainst influenza virus by administering to the individual animmunologically effective amount of any of the above polypeptides in aphysiologically acceptable carrier.

Additionally, the invention includes recombinant influenza virus thatcomprises one or more of the polypeptides or polynucleotides above, inaddition to immunogenic compositions comprising an immunologicallyeffective amount of such recombinant influenza virus. Methods forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, through administering animmunologically effective amount of such recombinant influenza virus ina physiologically acceptable carrier are also part of the invention.

In other aspects, the invention comprises an isolated or recombinantnucleic acid that is selected from: any one of the polynucleotidesequences SEQ ID NO:1 through SEQ ID NO:10 or SEQ ID NO:21 through SEQID NO:26 or SEQ ID NO:33 through SEQ ID NO:38, or SEQ ID NO:45 (orcomplementary sequences thereof), any one of the polynucleotidesequences encoding polypeptide of SEQ ID NO:11 through SEQ ID NO:20 orSEQ ID NO:27 through SEQ ID NO:32 or SEQ ID NO:39 through SEQ ID NO:44(or complementary polynucleotide sequences thereof), a polynucleotidesequence which hybridizes under highly stringent conditions oversubstantially the entire length of any of the above polynucleotidesequences, and a polynucleotide sequence comprising all or a fragment ofany of such polynucleotide sequences wherein the sequence preferablyencodes a hemagglutinin or neuraminidase polypeptide or a fragment of ahemagglutinin or neuraminidase polypeptide. The invention also includesan isolated or recombinant nucleic acid that encodes an amino acidsequence which is substantially identical over at least about 300 aminoacids of any polypeptide encoded by the above nucleic acids, or over atleast about 350 amino acids; over at least about 400 amino acids; overat least about 450 amino acids; over at least about 500 amino acids;over at least about 502 amino acids; over at least about 550 aminoacids; over at least about 559 amino acids; over at least about 565amino acids; or over at least about 566 amino acids of any polypeptideencoded by the above nucleic acids. Again, in situations wherein theamino acid is less than, e.g., 566, 565, 559, etc. in length (e.g., see,“SEQUENCES”) then it should be understood that the length is optionallyless than 566, 565, 559, etc. The invention also includes any of theabove nucleic acids that comprise a polynucleotide encoding ahemagglutinin or neuraminidase polypeptide, or one or more fragments ofone or more hemagglutinin or neuraminidase polypeptide. Other aspects ofthe invention include isolated or recombinant nucleic acids that encodea polypeptide (optionally a hemagglutinin or neuraminidase polypeptide)whose sequence has at least 98% identity, at least 98.5% identity, atleast 99% identity, at least 99.2% identity, at least 99.4% identity, atleast 99.6% identity, at least 99.8% identity, or at least 99.9%identity to at least one of the above described polynucleotides. Theinvention also includes isolated or recombinant nucleic acids encoding apolypeptide of hemagglutinin or neuraminidase produced by mutating orrecombining one or more above described polynucleotide sequences. Thepolynucleotide sequences of the invention can optionally comprise one ormore of, e.g., a leader sequence, a precursor sequence, or an epitopetag sequence or the like, and can optionally encode a fusion protein(e.g., with one or more additional nucleic acid sequences).

In yet other embodiments, the invention comprises a composition ofmatter having two or more above described nucleic acids (e.g., a librarycomprising at least about 2, 5, 10, 50 or more nucleic acids). Suchcompositions can optionally be produced by cleaving one or more abovedescribed nucleic acid (e.g., mechanically, chemically, enzymaticallywith a restriction endonuclease/RNAse/DNAse, etc.). Other compositionsof the invention include, e.g., compositions produced by incubating oneor more above described nucleic acid in the presence ofdeoxyribonucleotide triphosphates and a thermostable nucleic acidpolymerase.

The invention also encompasses cells comprising at least one of theabove described nucleic acids, or a cleaved or amplified fragment orproduct thereof. Such cells can optionally express a polypeptide encodedby such nucleic acid. Other embodiments of the invention include vectors(e.g., plasmids, cosmids, phage, viruses, virus fragments, etc.)comprising any of above described nucleic acids. Such vectors canoptionally comprise an expression vector. Preferred expression vectorsof the invention include, but are not limited to, vectors comprising polI promoter and terminator sequences or vectors using both the pol I andpol II promoters “the polI/polII promoter system” (e.g., Zobel et al.,Nucl. Acids Res. 1993, 21:3607; US20020164770; Neumann et al., Proc.Natl. Acad. Sci. USA 1999, 96:9345; Fodor et al., J. Virol. 1999,73:9679; and US20030035814). Cells transduced by such vectors are alsowithin the current invention.

In some embodiments, the invention encompasses a virus (e.g., aninfluenza virus) comprising one or more above described nucleic acids(e.g., encoding hemagglutinin and/or neuraminidase), or one or morefragments thereof. Immunogenic compositions comprising such virus arealso part of the current invention. Such viruses can comprises areassortment virus such as a 6:2 reassortment virus (e.g., comprising 6genes encoding regions from one or more donor virus and 2 genes encodingregions from one or more above described nucleotide sequence (or one ormore fragment thereof) which can optionally comprise hemagglutininand/or neuraminidase). Reassortment viruses (optionally live viruses) ofthe invention can include donor viruses that are one or more of, e.g.,cold-sensitive, cold-adapted, or an attenuated. For example,reassortment viruses can comprise e.g., A/Ann Arbor/6/60, PR8, etc.Reassortment viruses of the invention may alternatively exclude A/AnnArbor/6/60. One preferred embodiment of the invention is a reassortantinfluenza virus, wherein the virus is a 6:2 reassortment influenza virusand comprises 6 gene encoding regions from A/Ann Arbor/6/60 and 2 geneencoding regions that encode a polypeptide selected from the groupconsisting of: the polypeptides of SEQ ID NOS:11-20, SEQ ID NOS:27-32,and SEQ ID NOS:39-44. In an alternative embodiment, a reassortantinfluenza virus of the invention includes a 6:2 reassortment influenzavirus, wherein said virus comprises 6 gene encoding regions from one ormore donor viruses other than A/Ann Arbor/6/60 and 2 gene encodingregions that encode a polypeptide selected from the group consisting of:the polypeptides of SEQ ID NOS:11-20, SEQ ID NOS:27-32, and SEQ IDNOS:39-44. In another alternative embodiment, a reassortant influenzavirus of the invention includes a 6:2 reassortment influenza virus,wherein said virus comprises 6 gene encoding regions from one or moredonor viruses other than A/Ann Arbor/6/60 and 2 gene encoding regions,wherein the 2 gene encoding regions are HA or NA polypeptides from anypandemic influenza strain. Methods of producing recombinant influenzavirus through culturing a host cell harboring an influenza virus in asuitable culture medium under conditions permitting expression ofnucleic acid and, isolating the recombinant influenza virus from one ormore of the host cell or the medium are also part of the invention.

In other embodiments herein, the invention comprises immunogeniccompositions having an immunologically effective amount of any of theabove described recombinant influenza virus. Other embodiments includemethods for stimulating the immune system of an individual to produce aprotective immune response against influenza virus by administering tothe individual an immunologically effective amount of any of therecombinant influenza virus described above (optionally in aphysiologically effective carrier).

Other aspects of the invention include methods of producing an isolatedor recombinant polypeptide by culturing any host cell above, in asuitable culture medium under conditions permitting expression ofnucleic acid and, isolating the polypeptide from one or more of the hostcells or the medium in which is the cells are grown.

Immunogenic compositions are also features of the invention. Forexample, immunogenic compositions comprising one or more of any of thepolypeptides and/or nucleic acids described above and, optionally, anexcipient such as a pharmaceutically acceptable excipient or one or morepharmaceutically acceptable administration component. Immunogeniccompositions of the invention can also comprise any one or more abovedescribed virus as well (e.g., along with one or more pharmaceuticallyacceptable administration component).

Methods of producing immunogenic responses in a subject throughadministration of an effective amount of any of the above viruses (orimmunogenic compositions) to a subject are also within the currentinvention. Additionally, methods of prophylactic or therapeutictreatment of a viral infection (e.g., viral influenza) in a subjectthrough administration of any one or more above described virus (orimmunogenic compositions) in an amount effective to produce animmunogenic response against the viral infection are also part of thecurrent invention. Subjects for such treatment can include mammals(e.g., humans). Such methods can also comprise in vivo administration tothe subject as well as in vitro or ex vivo administration to one or morecells of the subject. Additionally, such methods can also compriseadministration of a composition of the virus and a pharmaceuticallyacceptable excipient that are administered to the subject in an amounteffect to prophylactically or therapeutically treat the viral infection.

In other aspects the invention includes compositions of mattercomprising nucleic acid sequences encoding hemagglutinin and/orneuraminidase polypeptides of one or more pandemic influenza strain andnucleic acid sequences encoding one or more polypeptide of A/AnnArbor/6/60. Additionally, the invention includes compositions of mattercomprising nucleic acid sequences encoding hemagglutinin and/orneuraminidase polypeptides of one or more pandemic influenza strain andnucleic acid sequences encoding one or more polypeptide of PR8 or A/AnnArbor/6/60. Such sequences can include those listed in the “SEQUENCES”herein. Additionally, preferred embodiments of the invention includecompositions of matter comprising sequences encoding hemagglutininand/or neuraminidase of one or more pandemic influenza strain andnucleic acid sequences encoding a selected backbone strain in a 6:2reassortment. Such compositions preferably include sequences encodingthe hemagglutinin and neuraminidase selected from the “SEQUENCES” hereinand a backbone strain, wherein the backbone strain is PR8 or A/AnnArbor/6/60. The invention also includes such compositions as describedabove wherein the hemagglutinin comprises a modified polybasic cleavagesite. The invention also includes live attenuated influenza vaccinecomprising such above compositions.

These and other objects and features of the invention will become morefully apparent when the following detailed description is read inconjunction with the accompanying figures and appendix.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1: Shows modifications engineered into the HA gene of VN/1203/2004to remove the polybasic cleavage site. FIG. 1 discloses SEQ ID NOS 46-47and 55-56, respectively, in order of appearance.

FIG. 2: Displays results showing that intranasally administered H5N1 careassortant viruses do not replicate in chickens.

FIG. 3: Illustrates that the H5N1/AA ca vaccine candidates are notlethal to mice.

FIG. 4: Illustrates that the 1997 and 2004 H5N1 ca reassortant virusesare restricted in replication in mice.

FIG. 5: Illustrates that the reassortant H5N1/AA ca influenza virusesare restricted in replication in lungs of mice.

FIG. 6: Shows the serum HAI Ab titers elicited in mice following asingle i.n. dose of vaccine.

FIG. 7: Shows serum neutralizing Ab titers elicited in mice following asingle i.n. dose of vaccine.

FIG. 8: Illustrates that H5N1 ca reassortant viruses protect mice fromlethal challenges with 50, 500 or 5000 LD₅₀ of wild-type H5N1 viruses.

FIG. 9: Illustrates the efficacy of protection from pulmonaryreplication of homologous and heterologous H5N1 challenge viruses inmice.

FIG. 10: Illustrates the efficacy of protection from replication ofhomologous and heterologous H5N1 challenge viruses in the upperrespiratory tract of mice.

FIG. 11: Illustrates the efficacy of protection conferred by 2004 H5N1ca vaccine against high dose (10₅TCID₅₀) challenge with homologous orheterologous H5N1 wt viruses in mice.

FIG. 12: Illustrates the efficacy of protection conferred by 1997 and2003 H5N1 ca vaccines against high dose (10₅TCID₅₀) challenges withhomologous or heterologous H5N1 wild-type viruses in mice.

FIG. 13: Illustrates the efficacy of protection conferred by 2004 H5N1ca vaccine against low or high doses of homologous H5N1 wild-type viruschallenges in mice.

FIG. 14: Shows modifications that can be engineered into the HA gene ofA/Netherland/219/03 HA to remove the polybasic cleavage site. FIG. 14discloses SEQ ID NOS 57-58 and 48-53, respectively, in order ofappearance.

FIG. 15: H5N1 ca vaccines elicit serum neutralizing antibody titers inmice. Sera were collected before (prebleed) and 28 days following eachdose of vaccine; an undetectable titer is assigned a value of 10.

FIG. 16: H6 ca viruses are attenuated in ferrets. *EID₅₀/g for lungs;PFU/g for nasal turbinates. 10⁷ TCID₅₀ inoculated intranasally, tissueswere harvested on day 3 post-infection.

FIG. 17: Immunogenicity of H6 ca vaccines in ferrets.

FIGS. 18( a) and 18(b): Efficacy of H6 ca vaccines in ferrets. Virustiter was measured in (a) lungs and (b) nasal turbinates. Vaccine: 1dose of 7 log₁₀ PFU. Challenge: 7 log₁₀ PFU; 3 days post-challenge.

FIG. 19: H7N3 BC 04 ca is attenuated in ferrets. Inoculum: 10⁷ TCID₅₀ in0.5 mL intranasally. Tissues were harvested on day 3 post-infection.

FIG. 20: H7N3 BC 2004 ca is immunogenic in mice. a: Reciprocal geometricmean of serum neutralizing antibody titers against ck/BC/CN-6/04 wt. b:p<0.05 (Mann-Whitney U-test) compared to neutralization titers at 28days post-infection.

FIG. 21( a)-21(i): H7N3 BC 04 ca is efficacious against lethal challengewith H7 viruses in mice. Efficacy against a lethal challenge of 50 LD₅₀A/ck/BC/CN-7/04: four weeks following immunization with a single dose(a), eight weeks following immunization with a single dose (b), or eightweeks following immunization with 2 doses (2 doses administered at 4weeks apart) (c). Efficacy against a lethal challenge of 50 LD₅₀A/NL/219/03: four weeks following immunization with a single dose (d),eight weeks following immunization with a single dose (e), or eightweeks following immunization with 2 doses (2 doses administered at 4weeks apart) (f). Efficacy against a lethal challenge of 50 LD₅₀A/tk/Eng/63: four weeks following immunization with a single dose (g),eight weeks following immunization with a single dose (h), or eightweeks following immunization with 2 doses (2 doses administered at 4weeks apart) (i).

FIGS. 22( a) and (b): H7N3 BC 04 ca vaccine is efficacious in mice.Virus titer was measured at 8 weeks in (a) nasal turbinates and (b)lungs.

FIGS. 23( a) and (b): H9N2 G9/AA ca is attenuated in ferrets. Virustiter was measured in (a) nasal turbinates and (b) lungs.

FIGS. 24( a) and (b): Efficacy of the H9N2 ca vaccine in mice.

FIG. 25: Replication of H9N2 G9/AA ca is highly restricted in healthyadults.

FIG. 26: HI antibody responses to 10^(7.0) TCID₅₀ of H9N2 G9/AA ca inhealthy adults.

FIG. 27: Replication of H5N1 VN2004 A/AA ca is highly restricted inhealthy adults.

FIG. 28: HI antibody responses to 10^(6.7) TCID₅₀ of VN2004 A/AA ca inhealthy adults.

DETAILED DESCRIPTION

The present invention includes polypeptide and polynucleotide sequencesof influenza hemagglutinin and neuraminidase as well as vectors,compositions and the like comprising such sequences and methods of theiruse. Additional features of the invention are described in more detailherein.

Definitions

Unless defined otherwise, all technical and scientific terms used hereinhave the same meaning as commonly understood by one of ordinary skill inthe art to which the invention pertains. The following definitionssupplement those in the art and are directed to the current applicationand are not necessarily to be imputed to any related or unrelated case,e.g., to any commonly owned patent or application. Although any methodsand materials similar or equivalent to those described herein can beused in the practice for testing of the present invention, the preferredmaterials and methods are described herein. Accordingly, the terminologyused herein is for the purpose of describing particular embodimentsonly, and is not intended to be limiting.

As used in this specification and the appended claims, the singularforms “a,” “an” and “the” include plural referents unless the contextclearly dictates otherwise. Thus, for example, reference to “a virus”includes a plurality of viruses; reference to a “host cell” includesmixtures of host cells, and the like.

An “amino acid sequence” is a polymer of amino acid residues (a protein,polypeptide, etc.) or a character string representing an amino acidpolymer, depending on context.

The terms “nucleic acid,” “polynucleotide,” “polynucleotide sequence”and “nucleic acid sequence” refer to single-stranded or double-strandeddeoxyribonucleotide or ribonucleotide polymers, chimeras or analoguesthereof, or a character string representing such, depending on context.As used herein, the term optionally includes polymers of analogs ofnaturally occurring nucleotides having the essential nature of naturalnucleotides in that they hybridize to single-stranded nucleic acids in amanner similar to naturally occurring nucleotides (e.g., peptide nucleicacids). Unless otherwise indicated, a particular nucleic acid sequenceof this invention optionally encompasses complementary sequences inaddition to the sequence explicitly indicated. From any specifiedpolynucleotide sequence, either the given nucleic acid or thecomplementary polynucleotide sequence (e.g., the complementary nucleicacid) can be determined.

The term “nucleic acid” or “polynucleotide” also encompasses anyphysical string of monomer units that can be corresponded to a string ofnucleotides, including a polymer of nucleotides (e.g., a typical DNA orRNA polymer), PNAs, modified oligonucleotides (e.g., oligonucleotidescomprising bases that are not typical to biological RNA or DNA insolution, such as 2′-O-methylated oligonucleotides), and the like. Anucleic acid can be e.g., single-stranded or double-stranded.

A “subsequence” is any portion of an entire sequence, up to andincluding the complete sequence. Typically, a subsequence comprises lessthan the full-length sequence. A “unique subsequence” is a subsequencethat is not found in any previously determined influenza polynucleotideor polypeptide sequence.

The term “variant” with respect to a polypeptide refers to an amino acidsequence that is altered by one or more amino acids with respect to areference sequence. The variant can have “conservative” changes, whereina substituted amino acid has similar structural or chemical properties,e.g., replacement of leucine with isoleucine. Alternatively, a variantcan have “nonconservative” changes, e.g., replacement of a glycine witha tryptophan. Analogous minor variation can also include amino aciddeletion or insertion, or both. Guidance in determining which amino acidresidues can be substituted, inserted, or deleted without eliminatingbiological or immunological activity can be found using computerprograms well known in the art, for example, DNASTAR software. Examplesof conservative substitutions are also described herein.

The term “gene” is used broadly to refer to any nucleic acid associatedwith a biological function. Thus, genes include coding sequences and/orthe regulatory sequences required for their expression. The term “gene”applies to a specific genomic sequence, as well as to a cDNA or an mRNAencoded by that genomic sequence.

Genes also include non-expressed nucleic acid segments that, forexample, form recognition sequences for other proteins. Non-expressedregulatory sequences include “promoters” and “enhancers,” to whichregulatory proteins such as transcription factors bind, resulting intranscription of adjacent or nearby sequences. A “tissue specific”promoter or enhancer is one that regulates transcription in a specifictissue type or cell type, or types.

“Expression of a gene” or “expression of a nucleic acid” meanstranscription of DNA into RNA (optionally including modification of theRNA, e.g., splicing), translation of RNA into a polypeptide (possiblyincluding subsequent modification of the polypeptide, e.g.,post-translational modification), or both transcription and translation,as indicated by the context.

An “open reading frame” or “ORF” is a possible translational readingframe of DNA or RNA (e.g., of a gene), which is capable of beingtranslated into a polypeptide. That is, the reading frame is notinterrupted by stop codons. However, it should be noted that the termORF does not necessarily indicate that the polynucleotide is, in fact,translated into a polypeptide.

The term “vector” refers to the means by which a nucleic acid can bepropagated and/or transferred between organisms, cells, or cellularcomponents. Vectors include plasmids, viruses, bacteriophages,pro-viruses, phagemids, transposons, artificial chromosomes, and thelike, that replicate autonomously or can integrate into a chromosome ofa host cell. A vector can also be a naked RNA polynucleotide, a nakedDNA polynucleotide, a polynucleotide composed of both DNA and RNA withinthe same strand, a poly-lysine-conjugated DNA or RNA, apeptide-conjugated DNA or RNA, a liposome-conjugated DNA, or the like,that is not autonomously replicating. In many, but not all, commonembodiments, the vectors of the present invention are plasmids.

An “expression vector” is a vector, such as a plasmid that is capable ofpromoting expression, as well as replication of a nucleic acidincorporated therein. Typically, the nucleic acid to be expressed is“operably linked” to a promoter and/or enhancer, and is subject totranscription regulatory control by the promoter and/or enhancer.

A “bi-directional expression vector” is characterized by two alternativepromoters oriented in the opposite direction relative to a nucleic acidsituated between the two promoters, such that expression can beinitiated in both orientations resulting in, e.g., transcription of bothplus (+) or sense strand, and negative (−) or antisense strand RNAs.

A “polypeptide” is a polymer comprising two or more amino acid residues(e.g., a peptide or a protein). The polymer can optionally comprisemodifications such as glycosylation or the like. The amino acid residuesof the polypeptide can be natural or non-natural and can beunsubstituted, unmodified, substituted or modified.

In the context of the invention, the term “isolated” refers to abiological material, such as a virus, a nucleic acid or a protein, whichis substantially free from components that normally accompany orinteract with it in its naturally occurring environment. The isolatedbiological material optionally comprises additional material not foundwith the biological material in its natural environment, e.g., a cell orwild-type virus. For example, if the material is in its naturalenvironment, such as a cell, the material can have been placed at alocation in the cell (e.g., genome or genetic element) not native tosuch material found in that environment. For example, a naturallyoccurring nucleic acid (e.g., a coding sequence, a promoter, anenhancer, etc.) becomes isolated if it is introduced by non-naturallyoccurring means to a locus of the genome (e.g., a vector, such as aplasmid or virus vector, or amplicon) not native to that nucleic acid.Such nucleic acids are also referred to as “heterologous” nucleic acids.An isolated virus, for example, is in an environment (e.g., a cellculture system, or purified from cell culture) other than the nativeenvironment of wild-type virus (e.g., the nasopharynx of an infectedindividual).

The term “chimeric” or “chimera,” when referring to a virus, indicatesthat the virus includes genetic and/or polypeptide components derivedfrom more than one parental viral strain or source. Similarly, the term“chimeric” or “chimera,” when referring to a viral protein, indicatesthat the protein includes polypeptide components (i.e., amino acidsubsequences) derived from more than one parental viral strain orsource.

The term “recombinant” indicates that the material (e.g., a nucleic acidor protein) has been artificially or synthetically (non-naturally)altered by human intervention. The alteration can be performed on thematerial within, or removed from, its natural environment or state.Specifically, e.g., an influenza virus is recombinant when it isproduced by the expression of a recombinant nucleic acid. For example, a“recombinant nucleic acid” is one that is made by recombining nucleicacids, e.g., during cloning, DNA shuffling or other procedures, or bychemical or other mutagenesis; a “recombinant polypeptide” or“recombinant protein” is a polypeptide or protein which is produced byexpression of a recombinant nucleic acid; and a “recombinant virus”,e.g., a recombinant influenza virus, is produced by the expression of arecombinant nucleic acid.

The term “reassortant,” when referring to a virus, indicates that thevirus includes genetic and/or polypeptide components derived from morethan one parental viral strain or source. For example, a 7:1 reassortantincludes 7 viral genomic segments (or gene segments) derived from afirst parental virus, and a single complementary viral genomic segment,e.g., encoding a hemagglutinin or neuraminidase of the invention. A 6:2reassortant includes 6 genomic segments, most commonly the 6 internalgenes from a first parental virus, and two complementary segments, e.g.,hemagglutinin and neuraminidase, from a different parental virus.

The term “introduced” when referring to a heterologous or isolatednucleic acid refers to the incorporation of a nucleic acid into aeukaryotic or prokaryotic cell where the nucleic acid can beincorporated into the genome of the cell (e.g., chromosome, plasmid,plastid or mitochondrial DNA), converted into an autonomous replicon, ortransiently expressed (e.g., transfected mRNA). The term includes suchmethods as “infection,” “transfection,” “transformation” and“transduction.” In the context of the invention a variety of methods canbe employed to introduce nucleic acids into cells, includingelectroporation, calcium phosphate precipitation, lipid mediatedtransfection (lipofection), etc.

The term “host cell” means a cell that contains a heterologous nucleicacid, such as a vector, and supports the replication and/or expressionof the nucleic acid. Host cells can be prokaryotic cells such as E.coli, or eukaryotic cells such as yeast, insect, amphibian, avian ormammalian cells, including human cells. Exemplary host cells caninclude, e.g., Vero (African green monkey kidney) cells, BHK (babyhamster kidney) cells, primary chick kidney (PCK) cells, Madin-DarbyCanine Kidney (MDCK) cells, Madin-Darby Bovine Kidney (MDBK) cells, 293cells (e.g., 293T cells), and COS cells (e.g., COS1, COS7 cells), etc.

An “immunologically effective amount” of influenza virus is an amountsufficient to enhance an individual's (e.g., a human's) own immuneresponse against a subsequent exposure to influenza virus. Levels ofinduced immunity can be monitored, e.g., by measuring amounts ofneutralizing secretory and/or serum antibodies, e.g., by plaqueneutralization, complement fixation, enzyme-linked immunosorbent, ormicroneutralization assay.

A “protective immune response” against influenza virus refers to animmune response exhibited by an individual (e.g., a human) that isprotective against disease when the individual is subsequently exposedto and/or infected with such influenza virus. In some instances, theinfluenza virus (e.g., naturally circulating) can still cause infection,but it cannot cause a serious infection. Typically, the protectiveimmune response results in detectable levels of host engendered serumand secretory antibodies that are capable of neutralizing virus of thesame strain and/or subgroup (and possibly also of a different,non-vaccine strain and/or subgroup) in vitro and in vivo.

As used herein, an “antibody” is a protein comprising one or morepolypeptides substantially or partially encoded by immunoglobulin genesor fragments of immunoglobulin genes. The recognized immunoglobulingenes include the kappa, lambda, alpha, gamma, delta, epsilon and muconstant region genes, as well as myriad immunoglobulin variable regiongenes. Light chains are classified as either kappa or lambda. Heavychains are classified as gamma, mu, alpha, delta, or epsilon, which inturn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE,respectively. A typical immunoglobulin (antibody) structural unitcomprises a tetramer. Each tetramer is composed of two identical pairsof polypeptide chains, each pair having one “light” (about 25 kD) andone “heavy” chain (about 50-70 kD). The N-terminus of each chain definesa variable region of about 100 to 110 or more amino acids primarilyresponsible for antigen recognition. The terms variable light chain (VL)and variable heavy chain (VH) refer to these light and heavy chainsrespectively. Antibodies exist as intact immunoglobulins or as a numberof well-characterized fragments produced by digestion with variouspeptidases. Thus, for example, pepsin digests an antibody below thedisulfide linkages in the hinge region to produce F(ab)′2, a dimer ofFab which itself is a light chain joined to VH—CH 1 by a disulfide bond.The F(ab)′2 may be reduced under mild conditions to break the disulfidelinkage in the hinge region thereby converting the (Fab′)2 dimer into aFab′ monomer. The Fab′ monomer is essentially a Fab with part of thehinge region (see, Fundamental Immunology, W. E. Paul, ed., Raven Press,N.Y. (1999), for a more detailed description of other antibodyfragments). While various antibody fragments are defined in terms of thedigestion of an intact antibody, one of skill will appreciate that suchFab′ fragments may be synthesized de novo either chemically or byutilizing recombinant DNA methodology. Thus, the term antibody, as usedherein, includes antibodies or fragments either produced by themodification of whole antibodies or synthesized de novo usingrecombinant DNA methodologies. Antibodies include, e.g., polyclonalantibodies, monoclonal antibodies, multiple or single chain antibodies,including single chain Fv (sFv or scFv) antibodies in which a variableheavy and a variable light chain are joined together (directly orthrough a peptide linker) to form a continuous polypeptide, andhumanized or chimeric antibodies.

Influenza Virus

The polypeptides and polynucleotides of the invention, e.g., SEQ ID NO:1-45, are variants of influenza HA and NA sequences. In general,influenza viruses are made up of an internal ribonucleoprotein corecontaining a segmented single-stranded RNA genome and an outerlipoprotein envelope lined by a matrix protein. The genome of influenzaviruses is composed of eight segments of linear (−) strand ribonucleicacid (RNA), encoding the immunogenic hemagglutinin (HA) andneuraminidase (NA) proteins, and six internal core polypeptides: thenucleocapsid nucleoprotein (NP); matrix proteins (M); non-structuralproteins (NS); and 3 RNA polymerase (PA, PB1, PB2) proteins. Duringreplication, the genomic viral RNA is transcribed into (+) strandmessenger RNA and (−) strand genomic cRNA in the nucleus of the hostcell. Each of the eight genomic segments is packaged intoribonucleoprotein complexes that contain, in addition to the RNA, NP anda polymerase complex (PB 1, PB2, and PA).

Influenza is commonly grouped into influenza A and influenza Bcategories. Influenza A and influenza B viruses each contain eightsegments of single stranded RNA with negative polarity. The influenza Agenome encodes eleven polypeptides. Segments 1-3 encode threepolypeptides, making up a RNA-dependent RNA polymerase. Segment 1encodes the polymerase complex protein PB2. The remaining polymeraseproteins PB 1 and PA are encoded by segment 2 and segment 3,respectively. In addition, segment 1 of some influenza strains encodes asmall protein, PB 1-F2, produced from an alternative reading framewithin the PB 1 coding region. Segment 4 encodes the hemagglutinin (HA)surface glycoprotein involved in cell attachment and entry duringinfection. Segment 5 encodes the nucleocapsid nucleoprotein (NP)polypeptide, the major structural component associated with viral RNA.Segment 6 encodes a neuraminidase (NA) envelope glycoprotein. Segment 7encodes two matrix proteins, designated M1 and M2, which are translatedfrom differentially spliced mRNAs. Segment 8 encodes NS1 and NS2, twononstructural proteins, which are translated from alternatively splicedmRNA variants. The eight genome segments of influenza B encode 11proteins. The three largest genes code for components of the RNApolymerase, PB1, PB2 and PA. Segment 4 encodes the HA protein. Segment 5encodes NP. Segment 6 encodes the NA protein and the NB protein. Bothproteins, NB and NA, are translated from overlapping reading frames of abicistronic mRNA. Segment 7 of influenza B also encodes two proteins: M1and BM2. The smallest segment encodes two products: NS1 is translatedfrom the full length RNA, while NS2 is translated from a spliced mRNAvariant.

Influenza Virus Vaccines

The sequences, compositions and methods herein are primarily, but notsolely, concerned with production of influenza viruses for vaccines.Historically, influenza virus vaccines have primarily been produced inembryonated hen eggs using strains of virus selected or based onempirical predictions of relevant strains. More recently, reassortantviruses have been produced that incorporate selected hemagglutininand/or neuraminidase antigens in the context of an approved attenuated,temperature sensitive master strain. Following culture of the virusthrough multiple passages in hen eggs, influenza viruses are recoveredand, optionally, inactivated, e.g., using formaldehyde and/orβ-propiolactone (or alternatively used in live attenuated vaccines).Thus, it will be appreciated that HA and NA sequences (e.g., SEQ ID NO:1-45) are quite useful in constructing influenza vaccines. The currentinvention includes viruses/vaccines comprising HA and/or NA sequences ofpandemic influenza strains (including wherein the HA sequences comprisemodified polybasic cleavage sites such as the modifications describedherein); and including wherein the viruses/vaccines comprise a cabackbone such as A/AA/6/60 or the backbone of PR8.

Attempts at producing recombinant and reassortant vaccines in cellculture have been hampered by the inability of some of the strainsapproved for vaccine production to grow efficiently under standard cellculture conditions. However, prior work by the inventors and theircoworkers provided a vector system, and methods for producingrecombinant and reassortant viruses in culture, thus, making it possibleto rapidly produce vaccines corresponding to one or many selectedantigenic strains of virus, e.g., either A or B strains, varioussubtypes or substrains, etc., e.g., comprising the HA and/or NAsequences herein. See, Multi-Plasmid System for the production ofInfluenza virus, U.S. Application No. 60/420,708, filed Oct. 23, 2002,U.S. application Ser. No. 10/423,828, filed Apr. 25, 2003 and U.S.Application 60/574,117 filed May 24, 2004. Typically, the cultures aremaintained in a system, such as a cell culture incubator, undercontrolled humidity and CO₂, at constant temperature using a temperatureregulator, such as a thermostat to insure that the temperature does notexceed 35° C. Reassortant influenza viruses can be readily obtained byintroducing a subset of vectors corresponding to genomic segments of amaster influenza virus, in combination with complementary segmentsderived from strains of interest (e.g., HA and/or NA antigenic variantsherein). Typically, the master strains are selected on the basis ofdesirable properties relevant to vaccine administration. For example,for vaccine production, e.g., for production of a live attenuatedvaccine, the master donor virus strain may be selected for an attenuatedphenotype, cold adaptation and/or temperature sensitivity. As explainedelsewhere herein and, e.g., in U.S. patent application Ser. No.10/423,828, etc., various embodiments of the invention utilize A/AnnArbor (AA)/6/60 influenza strain as a “backbone” upon which to add HAand/or NA genes (e.g., such as those sequences listed herein, etc.) tocreate desired reassortant viruses. Thus, for example, in a 6:2reassortant, 2 genes (i.e., NA and HA) would be from the influenzastrain(s) against which an immunogenic reaction is desired, while theother 6 genes would be from the Ann Arbor strain, or other backbonestrain, etc. The Ann Arbor virus is useful for its cold adapted,attenuated, temperature sensitive attributes. Of course, it will beappreciated that the HA and NA sequences herein are capable ofreassortment with a number of other virus genes or virus types (e.g., anumber of different “backbones” such as PR8, etc., containing the otherinfluenza genes present in a reassortant, namely, the non-HA and non-NAgenes

Various embodiments herein can comprise live attenuated vaccines, havingthe HA and/or NA sequences herein, for pandemic influenza. Such vaccinestypically comprise, e.g., the HA and/or NA sequences of SEQ ID NO:11-20, 27-32, or 39-44, or their corresponding encoding nucleotides ofSEQ ID NO: 1-10, 21-26, 33-38, or 45. One problem arising from growth ofvaccine virus strains (e.g., reassortants) in eggs is that avian strains(which can be involved in pandemics) can kill the eggs in which thevaccines are to be produced and are, thus, hard to manipulate, produce,etc. through use of traditional (non-plasmid rescue) reassortantproduction. Such avian strains are of interest since evidence indicatesthey can result in influenza in humans and possible pandemics. Thus, useof plasmid-rescue systems to create/manipulate influenza reassortantswith pandemic strains such as various avian sequences (e.g., the HA andNA sequences herein) are quite desirable and are features of theinvention. It will be appreciated, however, that the current sequencesare also capable of use with non-plasmid or traditional systems.

Aquatic birds (among others) can be infected by influenza A viruses of15 hemagglutinin (HA) and 9 neuraminidase (NA) subtypes. Such birds canserve as a reservoir from which novel influenza subtypes can beintroduced into human populations and cause pandemics. The observationthat avian H7N7 influenza A viruses infected humans in The Netherlandsin 2003 and avian H5N1 and H9N2 viruses infected humans in Hong Kong andChina earlier, raise concerns that these (and other) subtypes have thepotential to cause pandemics. Thus, vaccines are needed to prevent humaninfections with avian influenza A viruses. Live, attenuated influenza Avirus vaccines against human influenza viruses were recently licensed inthe United States. See above. Such vaccines are reassortant H1N1 andH3N2 viruses in which the internal protein genes of A/Ann Arbor(AA)/6/60 (H2N2) cold adapted (ca) virus confer the cold adapted,attenuation and temperature sensitive phenotypes of the AA ca virus onthe reassortant viruses (i.e., the ones having the hemagglutinin andneuraminidase genes from the non-Ann Arbor strain). Classical geneticreassortment and plasmid-based reverse genetics techniques have beenapplied to generate reassortant viruses that contain the hemagglutininand neuraminidase genes from avian influenza A viruses (H4-H14 subtypes)and six internal gene segments from the AA ca virus. Such reassortantviruses are features of the invention. See the HA and NA gene sequencesbelow. These viruses bear biological properties that are desirable incandidate vaccines because the phenotypes associated with the AA cavirus are present in the reassortant viruses. The generation andevaluation of these reassortant viruses as seed viruses for vaccines areimportant steps in pandemic preparedness. It is contemplated thatclinical trials can establish the safety, infectivity and immunogenicityof such live attenuated pandemic vaccines. Other embodiments of theinvention include reassortant viruses (e.g., those used in vaccines)comprising pandemic antigenic genes HA and/or NA from, e.g., avian,porcine, etc., pandemic virus strains in addition to those listedherein, to produce pandemic vaccines which are created throughplasmid-rescue reassortment (e.g., reassortment with A/Ann Arbor 6/60(i.e., A/AA/6/60), PR8, etc. Methods of construction and use of suchviruses and vaccines are also included. “Pandemic virus strains” as usedherein is defined as an influenza strain A virus subtype that it is notcirculating in the human population, that is declared to be a pandemicstrain by the Centers for Disease Control or the World HealthOrganization or generally acknowledged as such within the scientificcommunity.

In various embodiments herein, the antigenic sequences (e.g., the HAsequences) as well as viruses and vaccines from such viruses comprisemodified polybasic cleavage sites. Some highly pathogenic avian pandemicinfluenza strains comprise multiple basic amino acid cleavage siteswithin hemagglutinin sequences. See, e.g., Li et al., J. of InfectiousDiseases, 179:1132-8, 1999. Such cleavage sites, in typical embodimentsherein, are, e.g., modified or altered in their sequences in comparisonto the wild-type sequences from which the current sequences are derived(e.g., to disable the cleavage or reduce the cleavage there, etc.). Suchmodifications/alterations can be different in different strains due tothe various sequences of the cleavage sites in the wild-type sequences.For example, 4 polybasic residues (RRKK) (SEQ ID NO: 54) at 326-329 ofmature H5 are typically removed in sequences herein (as compared to wt).See “SEQUENCES.” In various embodiments, the polybasic cleavage sitescan be modified in a number of ways (all of which are contained withinthe invention). For example, the polybasic cleavage site can be removedone amino acid at a time (e.g., one R removed, two Rs removed, RRKremoved, or RRKK (SEQ ID NO: 54) removed). Additionally, the amino acidresidue directly upstream of the cleavage site can also be removed oraltered (e.g., from an R to a T, etc.); also, the nucleotides encodingthe amino acid residue directly after the cleavage site can also bemodified. See, e.g., FIG. 1 for an illustration of cleavage sitemodification. In addition, hemagglutinin polypeptide sequences ofinfluenza virus comprise amino terminal signal peptide sequences, thus,the hemagglutinin polypeptide sequences of the invention include boththe mature (amino terminal signal peptide cleaved) form of thehemagglutinin polypeptides and the pre-cleaved form of hemagglutinin.The cleavage sites of any hemagglutinin polypeptide sequence of anyinfluenza strain can be routinely measured or predicted using any numberof methods in the art.

The terms “temperature sensitive,” “cold adapted” and “attenuated” asapplied to viruses (typically used as vaccines or for vaccineproduction) which optionally encompass the current sequences, are wellknown in the art. For example, the term “temperature sensitive” (ts)indicates, e.g., that the virus exhibits a 100 fold or greater reductionin titer at 39° C. relative to 33° C. for influenza A strains, or thatthe virus exhibits a 100 fold or greater reduction in titer at 37° C.relative to 33° C. for influenza B strains. The term “cold adapted” (ca)indicates that the virus exhibits growth at 25° C. within 100 fold ofits growth at 33° C., while the term “attenuated” (att) indicates thatthe virus replicates in the upper airways of ferrets but is notdetectable in their lung tissues, and does not cause influenza-likeillness in the animal. It will be understood that viruses withintermediate phenotypes, i.e., viruses exhibiting titer reductions lessthan 100 fold at 39° C. (for A strain viruses) or 37° C. (for B strainviruses), or exhibiting growth at 25° C. that is more than 100 fold thanits growth at 33° C. (e.g., within 200 fold, 500 fold, 1000 fold, 10,000fold less), and/or exhibit reduced growth in the lungs relative togrowth in the upper airways of ferrets (i.e., partially attenuated)and/or reduced influenza like illness in the animal, are also usefulviruses and can be used in conjunction with the HA and NA sequencesherein.

Again, the HA and NA sequences of the current invention are optionallyutilized in the production of or in reassortment vaccines (and/or inother ts, cs, ca, and/or att viruses and vaccines). However, it shouldbe noted that the HA and NA sequences, etc. of the invention are notlimited to specific vaccine compositions or production methods, and can,thus, be utilized in substantially any vaccine type or vaccineproduction method which utilizes strain specific HA and NA antigens(e.g., any of SEQ ID NO: 11-20, 27-32, or 39-44 or the correspondingnucleotides encoding the specific HA and NA antigens, e.g., SEQ ID NO:1-10, 21-26, 33-38, or 45).

FluMist™

As mentioned previously, numerous examples and types of influenzavaccine exist. An exemplary influenza vaccine is FluMist™ which is alive, attenuated vaccine that protects children and adults frominfluenza illness (Belshe et al. (1998) The efficacy of live attenuated,cold-adapted, trivalent, intranasal influenza virus vaccine in childrenN Engl J Med 338:1405-12; Nichol et al. (1999) Effectiveness of live,attenuated intranasal influenza virus vaccine in healthy, workingadults: a randomized controlled trial JAMA 282:137-44). In typical, andpreferred, embodiments, the methods and compositions of the currentinvention are preferably adapted to/used with production of FluMist™vaccine. However, it will be appreciated by those skilled in the artthat the sequences, methods, compositions, etc. herein are alsoadaptable to production of similar or even different viral vaccines.

FluMist™ vaccine strains contain, e.g., HA and NA gene segments derivedfrom the strains (e.g., wild-type strains) to which the vaccine isaddressed along with six gene segments, PB1, PB2, PA, NP, M and NS, froma common master donor virus (MDV). The HA sequences herein, thus, arepart of various FluMist™ formulations. The MDV for influenza A strainsof FluMist™ (MDV-A), was created by serial passage of the wild-typeA/Ann Arbor/6/60 (A/AA/6/60) strain in primary chicken kidney tissueculture at successively lower temperatures (Maassab (1967) Adaptationand growth characteristics of influenza virus at 25 degrees C. Nature213:612-4). MDV-A replicates efficiently at 25° C. (ca, cold adapted),but its growth is restricted at 38 and 39° C. (ts, temperaturesensitive). Additionally, this virus does not replicate in the lungs ofinfected ferrets (att, attenuation). The ts phenotype is believed tocontribute to the attenuation of the vaccine in humans by restrictingits replication in all but the coolest regions of the respiratory tract.The stability of this property has been demonstrated in animal modelsand clinical studies. In contrast to the ts phenotype of influenzastrains created by chemical mutagenesis, the ts property of MDV-A doesnot revert following passage through infected hamsters or in shedisolates from children (for a recent review, see Murphy & Coelingh(2002) Principles underlying the development and use of live attenuatedcold-adapted influenza A and B virus vaccines Viral Immunol 15:295-323).

Clinical studies in over 20,000 adults and children involving 12separate 6:2 reassortant strains have shown that these vaccines areattenuated, safe and efficacious (Belshe et al. (1998) The efficacy oflive attenuated, cold-adapted, trivalent, intranasal influenza virusvaccine in children N Engl J Med 338:1405-12; Boyce et al. (2000) Safetyand immunogenicity of adjuvanted and unadjuvanted subunit influenzavaccines administered intranasally to healthy adults Vaccine 19:217-26;Edwards et al. (1994) A randomized controlled trial of cold adapted andinactivated vaccines for the prevention of influenza A disease J InfectDis 169:68-76; Nichol et al. (1999) Effectiveness of live, attenuatedintranasal influenza virus vaccine in healthy, working adults: arandomized controlled trial JAMA 282: 137-44). Reassortants carrying thesix internal genes of MDV-A and the two HA and NA gene segments of awild-type virus (i.e., a 6:2 reassortant) consistently maintain ca, tsand att phenotypes (Maassab et al. (1982) Evaluation of acold-recombinant influenza virus vaccine in ferrets J. Infect. Dis.146:780-900).

Production of such reassorted virus using B strains of influenza is moredifficult, however, recent work (see, e.g., Multi-Plasmid System for theProduction of Influenza Virus, U.S. Application No. 60/420,708, filedOct. 23, 2002, U.S. application Ser. No. 10/423,828, filed Apr. 25,2003, and U.S. Application No. 60/574,117, filed May 24, 2004) has shownan eight plasmid system for the generation of influenza B virus entirelyfrom cloned cDNA. Methods for the production of attenuated liveinfluenza A and B virus suitable for vaccine formulations, such as livevirus vaccine formulations useful for intranasal administration werealso shown.

The system and methods described previously are useful for the rapidproduction in cell culture of recombinant and reassortant influenza Aand B viruses, including viruses suitable for use as vaccines, includinglive attenuated vaccines, such as vaccines suitable for intranasaladministration. The sequences (e.g., nucleotide sequences SEQ ID NO:1-10, 21-26, 33-38, or 45 and the corresponding amino acids encoded bythe nucleotide sequences in SEQ ID NO: 11-20, 27-32, or 39-44), methods,etc. of the current invention, are optionally used in conjunction with,or in combination with, such previous work involving, e.g., reassertedinfluenza viruses for vaccine production to produce viruses forvaccines.

Methods and Compositions for Prophylactic Administration of Vaccines

As stated above, alternatively, or in addition to, use in production ofFluMist™ vaccine, the current invention can be used in other vaccineformulations. In general, recombinant and reassortant viruses of theinvention (e.g., those comprising polynucleotides of SEQ ID NO:1-10, 21,23-26, 33-38, or 45 or polypeptides of SEQ ID NO:11-20, 27-32, or 39-44,or fragments thereof) can be administered prophylactically in animmunologically effective amount and in an appropriate carrier orexcipient to stimulate an immune response specific for one or morestrains of influenza virus as determined by the HA and/or NA sequence.Typically, the carrier or excipient is a pharmaceutically acceptablecarrier or excipient, such as sterile water, aqueous saline solution,aqueous buffered saline solutions, aqueous dextrose solutions, aqueousglycerol solutions, ethanol, or combinations thereof. The preparation ofsuch solutions insuring sterility, pH, isotonicity, and stability iseffected according to protocols established in the art. Generally, acarrier or excipient is selected to minimize allergic and otherundesirable effects, and to suit the particular route of administration,e.g., subcutaneous, intramuscular, intranasal, etc.

A related aspect of the invention provides methods for stimulating theimmune system of an individual to produce a protective immune responseagainst influenza virus. In the methods, an immunologically effectiveamount of a recombinant influenza virus (e.g., comprising an HA and/orNA molecule of the invention), an immunologically effective amount of apolypeptide of the invention, and/or an immunologically effective amountof a nucleic acid of the invention is administered to the individual ina physiologically acceptable carrier.

Generally, the influenza viruses of the invention are administered in aquantity sufficient to stimulate an immune response specific for one ormore strains of influenza virus (i.e., against the HA and/or NA strainsof the invention). Preferably, administration of the influenza viruseselicits a protective immune response to such strains. Dosages andmethods for eliciting a protective immune response against one or moreinfluenza strains are known to those of skill in the art. See, e.g.,U.S. Pat. No. 5,922,326; Wright et al., Infect. Immun. 37:397-400(1982); Kim et al., Pediatrics 52:56-63 (1973); and Wright et al., J.Pediatr. 88:931-936 (1976). For example, influenza viruses are providedin the range of about 1-1000 HID₅₀ (human infectious dose), i.e., about10⁵-10⁸ pfu (plaque forming units) per dose administered. Typically, thedose will be adjusted within this range based on, e.g., age, physicalcondition, body weight, sex, diet, time of administration, and otherclinical factors. The prophylactic vaccine formulation is systemicallyadministered, e.g., by subcutaneous or intramuscular injection using aneedle and syringe, or a needle-less injection device. Alternatively,the vaccine formulation is administered intranasally, either by drops,large particle aerosol (greater than about 10 microns), or spray intothe upper respiratory tract. While any of the above routes of deliveryresults in a protective systemic immune response, intranasaladministration confers the added benefit of eliciting mucosal immunityat the site of entry of the influenza virus. For intranasaladministration, attenuated live virus vaccines are often preferred,e.g., an attenuated, cold adapted and/or temperature sensitiverecombinant or reassortant influenza virus. See above. While stimulationof a protective immune response with a single dose is preferred,additional dosages can be administered, by the same or different route,to achieve the desired prophylactic effect.

Typically, the attenuated recombinant influenza of this invention asused in a vaccine is sufficiently attenuated such that symptoms ofinfection, or at least symptoms of serious infection, will not occur inmost individuals immunized (or otherwise infected) with the attenuatedinfluenza virus. In some instances, the attenuated influenza virus canstill be capable of producing symptoms of mild illness (e.g., mild upperrespiratory illness) and/or of dissemination to unvaccinatedindividuals. However, its virulence is sufficiently abrogated such thatsevere lower respiratory tract infections do not occur in the vaccinatedor incidental host.

Alternatively, an immune response can be stimulated by ex vivo or invivo targeting of dendritic cells with influenza viruses comprising thesequences herein. For example, proliferating dendritic cells are exposedto viruses in a sufficient amount and for a sufficient period of time topermit capture of the influenza antigens by the dendritic cells. Thecells are then transferred into a subject to be vaccinated by standardintravenous transplantation methods.

While stimulation of a protective immune response with a single dose ispreferred, additional dosages can be administered, by the same ordifferent route, to achieve the desired prophylactic effect. In neonatesand infants, for example, multiple administrations may be required toelicit sufficient levels of immunity. Administration can continue atintervals throughout childhood, as necessary to maintain sufficientlevels of protection against wild-type influenza infection. Similarly,adults who are particularly susceptible to repeated or serious influenzainfection, such as, for example, health care workers, day care workers,family members of young children, the elderly, and individuals withcompromised cardiopulmonary function may require multiple immunizationsto establish and/or maintain protective immune responses. Levels ofinduced immunity can be monitored, for example, by measuring amounts ofneutralizing secretory and serum antibodies, and dosages adjusted orvaccinations repeated as necessary to elicit and maintain desired levelsof protection.

Optionally, the formulation for prophylactic administration of theinfluenza viruses also contains one or more adjuvants for enhancing theimmune response to the influenza antigens. Suitable adjuvants include:complete Freund's adjuvant, incomplete Freund's adjuvant, saponin,mineral gels such as aluminum hydroxide, surface active substances suchas lysolecithin, pluronic polyols, polyanions, peptides, oil orhydrocarbon emulsions, bacille Calmette-Guerin (BCG), Corynebacteriumparvum, and the synthetic adjuvant QS-21.

If desired, prophylactic vaccine administration of influenza viruses canbe performed in conjunction with administration of one or moreimmunostimulatory molecules. Immunostimulatory molecules include variouscytokines, lymphokines and chemokines with immunostimulatory,immunopotentiating, and pro-inflammatory activities, such asinterleukins (e.g., IL-1, IL-2, IL-3, IL-4, IL-12, IL-13); growthfactors (e.g., granulocyte-macrophage (GM)-colony stimulating factor(CSF)); and other immunostimulatory molecules, such as macrophageinflammatory factor, Flt3 ligand, B7.1; B7.2, etc. The immunostimulatorymolecules can be administered in the same formulation as the influenzaviruses, or can be administered separately. Either the protein (e.g., anHA and/or NA polypeptide of the invention, e.g., any of SEQ ID NO:11-20, 27-32, or 39-44) or an expression vector comprising a nucleicacid (e.g., any of SEQ ID NO: 1-10, 21-26, 33-38, or 45) encoding theprotein can be administered to produce an immunostimulatory effect.

The above described methods are useful for therapeutically and/orprophylactically treating a disease or disorder, typically influenza, byintroducing a vector of the invention comprising a heterologouspolynucleotide encoding a therapeutically or prophylactically effectiveHA and/or NA polypeptide (or peptide) or HA and/or NA RNA (e.g., anantisense RNA or ribozyme) into a population of target cells in vitro,ex vivo or in vivo. Typically, the polynucleotide encoding thepolypeptide (or peptide), or RNA, of interest is operably linked toappropriate regulatory sequences as described above in the sectionsentitled “Expression Vectors” and “Additional Expression Elements.”Optionally, more than one heterologous coding sequence is incorporatedinto a single vector or virus. For example, in addition to apolynucleotide encoding a therapeutically or prophylactically active HAand/or NA polypeptide or RNA, the vector can also include additionaltherapeutic or prophylactic polypeptides, e.g., antigens, co-stimulatorymolecules, cytokines, antibodies, etc., and/or markers, and the like.

Although vaccination of an individual with an attenuated influenza virusof a particular strain of a particular subgroup can inducecross-protection against influenza virus of different strains and/orsubgroups, cross-protection can be enhanced, if desired, by vaccinatingthe individual with attenuated influenza virus from at least twostrains, e.g., each of which represents a different subgroup.Additionally, vaccine combinations can optionally include mixes ofpandemic vaccines (e.g., those against pandemic influenza strains suchas various avian strains, see, e.g., the sequences herein, or otherpandemic strains) and non-pandemic strains. Vaccine mixtures (ormultiple vaccinations) can comprise components from human strains and/ornon-human influenza strains (e.g., avian and human, etc.). Similarly,the attenuated influenza virus vaccines of this invention can optionallybe combined with vaccines that induce protective immune responsesagainst other infectious agents.

Polynucleotides of the Invention

It is well known in the art that the HA and NA polynucleotide segmentsof influenza viruses comprise both a coding region (encoding the ORF)and noncoding regions (NCRs), both 5′ and 3′ of the HA and NA codingsequence. An example of these NCRs are shown in SEQ ID NOS:1-9 (outsideof the ORFs). It is also known that primers can be made to these NCRs tofacilitate amplification of the entire HA and NA segments of influenzavirus. (see, e.g., Hoffmann et al. Arch Virol. 2001 December;146(12):2275-89). Further, it is known that the NCRs of the HA and NA ofinfluenza may increase the efficiency of achieving reassortants.Therefore, the polynucleotide sequences of these NCRs (includingfragments and variants (e.g., at least about 60%, or at least 70%, or atleast 80%, or at least 90%, or at least about 91% or at least about 92%,or at least about 93%, or at least about 94%, or at least about 95%, orat least about 96%, or at least about 97%, or at least about 98%, or atleast about 98.5%, or at least about 98.7%, or at least about 99%, or atleast about 99.1%, or at least about 99.2%, or at least about 99.3%, orat least about 99.4%, or at least about 99.5%, or at least about 99.6%or at least about 99.7%, or at least about 99.8%, or at least about99.9% identity) thereof) are within the scope of this invention. Whenamplifying the HA and NA segments of any pandemic strain, one could makeand use polynucleotide primers to bind conserved (e.g., among relatedstrains) regions of the HA and NA NCRs for amplification (e.g., byRT-PCR). In one embodiment, HA and NA polynucleotides of the inventioninclude both the NCR and ORF of the HA and NA sequences (includingfragments and variants (e.g., at least about 60%, or at least 70%, or atleast 80%, or at least 90%, or at least about 91% or at least about 92%,or at least about 93%, or at least about 94%, or at least about 95%, orat least about 96%, or at least about 97%, or at least about 98%, or atleast about 98.5%, or at least about 98.7%, or at least about 99%, or atleast about 99.1%, or at least about 99.2%, or at least about 99.3%, orat least about 99.4%, or at least about 99.5%, or at least about 99.6%or at least about 99.7%, or at least about 99.8%, or at least about99.9%) thereof) of pandemic virus strains. In alternative embodiments,the HA and NA polynucleotides of the invention exclude the NCR, butinclude the ORF (including fragments and variants (e.g., at least about60%, or at least 70%, or at least 80%, or at least 90%, or at leastabout 91% or at least about 92%, or at least about 93%, or at leastabout 94%, or at least about 95%, or at least about 96%, or at leastabout 97%, or at least about 98%, or at least about 98.5%, or at leastabout 98.7%, or at least about 99%, or at least about 99.1%, or at leastabout 99.2%, or at least about 99.3%, or at least about 99.4%, or atleast about 99.5%, or at least about 99.6% or at least about 99.7%, orat least about 99.8%, or at least about 99.9% thereof)) of the HA and NAsequences of pandemic virus strains (e.g., SEQ ID NOS: 1-9).

The HA and NA polynucleotides of the invention, e.g., SEQ ID NO:1through SEQ ID NO:10, SEQ ID NO:21 through SEQ ID NO:26, SEQ ID NO:33through SEQ ID NO:38, SEQ ID NO:45, and fragments thereof, areoptionally used in a number of different capacities alternative to, orin addition to, the vaccines described above. Other exemplary uses aredescribed herein for illustrative purpose and not as limitations on theactual range of uses. Different methods of construction, purification,and characterization of the nucleotide sequences of the invention arealso described herein. In some embodiments, nucleic acids including oneor more polynucleotide sequence of the invention are favorably used asprobes for the detection of corresponding or related nucleic acids in avariety of contexts, such as in nucleic hybridization experiments, e.g.,to find and/or characterize homologous influenza variants (e.g.,homologues to the sequences herein, etc.) infecting other species or indifferent influenza outbreaks, etc. The probes can be either DNA or RNAmolecules, such as restriction fragments of genomic or cloned DNA,cDNAs, PCR amplification products, transcripts, and oligonucleotides,and can vary in length from oligonucleotides as short as about 10nucleotides in length to full length sequences or cDNAs in excess of 1kb or more. For example, in some embodiments, a probe of the inventionincludes a polynucleotide sequence or subsequence selected, e.g., fromamong SEQ ID NO: 1 through SEQ ID NO: 10, SEQ ID NO:21 through SEQ IDNO:26, SEQ ID NO:33 through SEQ ID NO:38, SEQ ID NO:45, or sequencescomplementary thereto. Alternatively, polynucleotide sequences that arevariants of one of the above-designated sequences are used as probes.Most typically, such variants include one or a few conservativenucleotide variations. For example, pairs (or sets) of oligonucleotidescan be selected, in which the two (or more) polynucleotide sequences areconservative variations of each other, wherein one polynucleotidesequence corresponds identically to a first variant or and the other(s)corresponds identically to additional variants. Such pairs ofoligonucleotide probes are particularly useful, e.g., for specifichybridization experiments to detect polymorphic nucleotides or to, e.g.,detect homologous influenza HA and NA variants, e.g., homologous to thecurrent HA and NA sequences, infecting other species or present indifferent (e.g., either temporally and/or geographically different)influenza outbreaks. In other applications, probes are selected that aremore divergent, that is probes that are at least about 91% (or about92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%,about 98.5%, about 98.7%, about 99%, about 99.1%, about 99.2%, about99.3%, about 99.4%, about 99.5%, or about 99.6% or more about 99.7%,about 99.8%, about 99.9% or more) identical are selected.

The probes of the invention, e.g., as exemplified by sequences derivedfrom the sequences herein, can also be used to identify additionaluseful polynucleotide sequences according to procedures routine in theart. In one set of embodiments, one or more probes, as described above,are utilized to screen libraries of expression products or chromosomalsegments (e.g., expression libraries or genomic libraries) to identifyclones that include sequences identical to, or with significant sequencesimilarity to, e.g., one or more probe of the sequences herein, i.e.,variants, homologues, etc. It will be understood that in addition tosuch physical methods as library screening, computer assistedbioinformatic approaches, e.g., BLAST and other sequence homology searchalgorithms, and the like, can also be used for identifying relatedpolynucleotide sequences. Polynucleotide sequences identified in thismanner are also a feature of the invention.

Oligonucleotide probes are optionally produced via a variety of methodswell known to those skilled in the art. Most typically, they areproduced by well known synthetic methods, such as the solid phasephosphoramidite triester method described by Beaucage and Caruthers(1981) Tetrahedron Letts 22(20):1859-1862, e.g., using an automatedsynthesizer, or as described in Needham-Van Devanter et al. (1984) NuclAcids Res, 12:6159-6168. Oligonucleotides can also be custom made andordered from a variety of commercial sources known to persons of skill.Purification of oligonucleotides, where necessary, is typicallyperformed by either native acrylamide gel electrophoresis or byanion-exchange HPLC as described in Pearson and Regnier (1983) J Chrom255:137-149. The sequence of the synthetic oligonucleotides can beverified using the chemical degradation method of Maxam and Gilbert(1980) in Grossman and Moldave (eds.) Academic Press, New York, Methodsin Enzymology 65:499-560. Custom oligos can also easily be ordered froma variety of commercial sources known to persons of skill.

In other circumstances, e.g., relating to attributes of cells ororganisms expressing the polynucleotides and polypeptides of theinvention (e.g., those harboring virus comprising the sequences of theinvention), probes that are polypeptides, peptides or antibodies arefavorably utilized. For example, isolated or recombinant polypeptides,polypeptide fragments and peptides derived from any of the amino acidsequences of the invention (e.g., SEQ ID NO: 11-20, SEQ ID NO: 27-32,SEQ ID NO:39-44) and/or encoded by polynucleotide sequences of theinvention, e.g., selected from SEQ ID NO: 1 through SEQ ID NO: 10, SEQID NO: 21 through SEQ ID NO: 26, SEQ ID NO:33 through SEQ ID NO:38, andSEQ ID NO:45 are favorably used to identify and isolate antibodies,e.g., from phage display libraries, combinatorial libraries, polyclonalsera, and the like. Polypeptide fragments of the inventions include apeptide or polypeptide comprising an amino acid sequence of at least 5contiguous amino acid residues, or at least 10 contiguous amino acidresidues, or at least 15 contiguous amino acid residues, or at least 20contiguous amino acid residues, or at least 25 contiguous amino acidresidues, or at least 40 contiguous amino acid residues, or at least 50contiguous amino acid residues, or at least 60 contiguous aminoresidues, or at least 70 contiguous amino acid residues, or at leastcontiguous 80 amino acid residues, or at least contiguous 90 amino acidresidues, or at least contiguous 100 amino acid residues, or at leastcontiguous 125 amino acid residues, or at least 150 contiguous aminoacid residues, or at least contiguous 175 amino acid residues, or atleast contiguous 200 amino acid residues, or at least contiguous 250amino acid residues, or at least contiguous 350, or at least contiguous400, or at least contiguous 450, or at least contiguous 500, or at leastcontiguous 550 amino acid residues of the amino acid sequence an HA orNA polypeptide of the invention (e.g., SEQ ID NOS: 11-20, SEQ ID NOS:27-32, and SEQ ID NOS: 39-44). Polynucleotides encoding said polypeptidefragments and antibodies that specifically bind said polypeptides arealso preferred embodiments of the invention.

Antibodies specific for any polypeptide sequence or subsequence, e.g.,of SEQ ID NO: 11 through SEQ ID NO: 20, SEQ ID NO: 27 through SEQ ID NO:32, and/or SEQ ID NO: 39 through SEQ ID NO: 44, and/or encoded bypolynucleotide sequences of the invention, e.g., selected from SEQ IDNO: 1 through SEQ ID NO: 10, SEQ ID NO: 21 through SEQ ID NO: 26, SEQ IDNO: 33 through SEQ ID NO: 38, and SEQ ID NO:45 are likewise valuable asprobes for evaluating expression products, e.g., from cells or tissues.In addition, antibodies are particularly suitable for evaluatingexpression of proteins comprising amino acid subsequences, e.g., ofthose given herein, or encoded by polynucleotides sequences of theinvention, e.g., selected from those shown herein, in situ, in a tissuearray, in a cell, tissue or organism, e.g., an organism infected by anunidentified influenza virus or the like. Antibodies can be directlylabeled with a detectable reagent, or detected indirectly by labeling ofa secondary antibody specific for the heavy chain constant region (i.e.,isotype) of the specific antibody. Additional details regardingproduction of specific antibodies are provided below.

Diagnostic Assays

The nucleic acid sequences of the present invention can be used indiagnostic assays to detect influenza (and/or hemagglutinin and/orneuraminidase) in a sample, to detect hemagglutinin-like and/orneuraminidase-like sequences, and to detect strain differences inclinical isolates of influenza using either chemically synthesized orrecombinant polynucleotide fragments, e.g., selected from the sequencesherein. For example, fragments of the hemagglutinin and/or neuraminidasesequences comprising at least between 10 and 20 nucleotides can be usedas primers to amplify nucleic acids using polymerase chain reaction(PCR) methods well known in the art (e.g., reverse transcription-PCR)and as probes in nucleic acid hybridization assays to detect targetgenetic material such as influenza RNA in clinical specimens.

The probes of the invention, e.g., as exemplified by unique subsequencesselected from those given herein, can also be used to identifyadditional useful polynucleotide sequences (such as to characterizeadditional strains of influenza) according to procedures routine in theart. In one set of preferred embodiments, one or more probes, asdescribed above, are utilized to screen libraries of expression productsor cloned viral nucleic acids (i.e., expression libraries or genomiclibraries) to identify clones that include sequences identical to, orwith significant sequence identity to the sequences herein. In turn,each of these identified sequences can be used to make probes, includingpairs or sets of variant probes as described above. It will beunderstood that in addition to such physical methods as libraryscreening, computer assisted bioinformatic approaches, e.g., BLAST andother sequence homology search algorithms, and the like, can also beused for identifying related polynucleotide sequences.

The probes of the invention are particularly useful for detecting thepresence and for determining the identity of influenza nucleic acids incells, tissues or other biological samples (e.g., a nasal wash orbronchial lavage). For example, the probes of the invention arefavorably utilized to determine whether a biological sample, such as asubject (e.g., a human subject) or model system (such as a cultured cellsample) has been exposed to, or become infected with influenza, orparticular strain(s) of influenza. Detection of hybridization of theselected probe to nucleic acids originating in (e.g., isolated from) thebiological sample or model system is indicative of exposure to orinfection with the virus (or a related virus) from which the probepolynucleotide is selected.

It will be appreciated that probe design is influenced by the intendedapplication. For example, where several allele-specific probe-targetinteractions are to be detected in a single assay, e.g., on a single DNAchip, it is desirable to have similar melting temperatures for all ofthe probes. Accordingly, the lengths of the probes are adjusted so thatthe melting temperatures for all of the probes on the array are closelysimilar (it will be appreciated that different lengths for differentprobes may be needed to achieve a particular T_(m) where differentprobes have different GC contents). Although melting temperature is aprimary consideration in probe design, other factors are optionally usedto further adjust probe construction, such as selecting against primerself-complementarity and the like.

Vectors, Promoters and Expression Systems

The present invention includes recombinant constructs incorporating oneor more of the nucleic acid sequences described herein. Such constructsoptionally include a vector, for example, a plasmid, a cosmid, a phage,a virus, a bacterial artificial chromosome (BAC), a yeast artificialchromosome (YAC), etc., into which one or more of the polynucleotidesequences of the invention, e.g., comprising any of SEQ ID NO: 1 throughSEQ ID NO:10, SEQ ID NO:21 through SEQ ID NO:26, SEQ ID NO:33 throughSEQ ID NO:38, SEQ ID NO:45 or a subsequence thereof etc., has beeninserted, in a forward or reverse orientation. For example, the insertednucleic acid can include a viral chromosomal sequence or cDNA includingall or part of at least one of the polynucleotide sequences of theinvention. In one embodiment, the construct further comprises regulatorysequences, including, for example, a promoter, operably linked to thesequence. Large numbers of suitable vectors and promoters are known tothose of skill in the art, and are commercially available.

The polynucleotides of the present invention can be included in any oneof a variety of vectors suitable for generating sense or antisense RNA,and optionally, polypeptide (or peptide) expression products (e.g., ahemagglutinin and/or neuraminidase molecule of the invention, orfragments thereof). Such vectors include chromosomal, nonchromosomal andsynthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids;phage DNA; baculovirus; yeast plasmids; vectors derived fromcombinations of plasmids and phage DNA, viral DNA such as vaccinia,adenovirus, fowl pox virus, pseudorabies, adenovirus, adeno-associatedvirus, retroviruses and many others (e.g., pCDL). Any vector that iscapable of introducing genetic material into a cell, and, if replicationis desired, which is replicable in the relevant host can be used.

In an expression vector, the HA and/or NA polynucleotide sequence ofinterest is physically arranged in proximity and orientation to anappropriate transcription control sequence (e.g., promoter, andoptionally, one or more enhancers) to direct mRNA synthesis. That is,the polynucleotide sequence of interest is operably linked to anappropriate transcription control sequence. Examples of such promotersinclude: LTR or SV40 promoter, E. coli lac or trp promoter, phage lambdaP_(L) promoter, and other promoters known to control expression of genesin prokaryotic or eukaryotic cells or their viruses.

A variety of promoters are suitable for use in expression vectors forregulating transcription of influenza virus genome segments. In certainembodiments, the cytomegalovirus (CMV) DNA dependent RNA Polymerase II(Pol II) promoter is utilized. If desired, e.g., for regulatingconditional expression, other promoters can be substituted which induceRNA transcription under the specified conditions, or in the specifiedtissues or cells. Numerous viral and mammalian, e.g., human promotersare available, or can be isolated according to the specific applicationcontemplated. For example, alternative promoters obtained from thegenomes of animal and human viruses include such promoters as theadenovirus (such as Adenovirus 2), papilloma virus, hepatitis-B virus,polyoma virus, and Simian Virus 40 (SV40), and various retroviralpromoters. Mammalian promoters include, among many others, the actinpromoter, immunoglobulin promoters, heat-shock promoters, and the like.

Transcription is optionally increased by including an enhancer sequence.Enhancers are typically short, e.g., 10-500 bp, cis-acting DNA elementsthat act in concert with a promoter to increase transcription. Manyenhancer sequences have been isolated from mammalian genes (hemoglobin,elastase, albumin, alpha-fetoprotein, and insulin), and eukaryotic cellviruses. The enhancer can be spliced into the vector at a position 5′ or3′ to the heterologous coding sequence, but is typically inserted at asite 5′ to the promoter. Typically, the promoter, and if desired,additional transcription enhancing sequences are chosen to optimizeexpression in the host cell type into which the heterologous DNA is tobe introduced (Scharf et al. (1994) Heat stress promoters andtranscription factors Results Probl Cell Differ 20:125-62; Kriegler etal. (1990) Assembly of enhancers, promoters, and splice signals tocontrol expression of transferred genes Methods in Enzymol 185: 512-27).Optionally, the amplicon can also contain a ribosome binding site or aninternal ribosome entry site (IRES) for translation initiation.

The vectors of the invention also favorably include sequences necessaryfor the termination of transcription and for stabilizing the mRNA, suchas a polyadenylation site or a terminator sequence. Such sequences arecommonly available from the 5′ and, occasionally 3′, untranslatedregions of eukaryotic or viral DNAs or cDNAs. In one embodiment, theSV40 polyadenylation signal sequences can provide a bi-directionalpolyadenylation site that insulates transcription of (+) strand mRNAmolecules from the PolI promoter initiating replication of the (−)strand viral genome.

In addition, as described above, the expression vectors optionallyinclude one or more selectable marker genes to provide a phenotypictrait for selection of transformed host cells, in addition to genespreviously listed, markers such as dihydrofolate reductase or neomycinresistance are suitable for selection in eukaryotic cell culture.

The vector containing the appropriate nucleic acid sequence as describedabove, as well as an appropriate promoter or control sequence, can beemployed to transform a host cell permitting expression of the protein.While the vectors of the invention can be replicated in bacterial cells,most frequently it will be desirable to introduce them into mammaliancells, e.g., Vero cells, BHK cells, MDCK cell, 293 cells, COS cells, orthe like, for the purpose of expression.

As described elsewhere, the HA and NA sequences herein, in variousembodiments, can be comprised within plasmids involved in plasmid-rescuereassortment. See, e.g., U.S. Application Nos. 60/420,708, filed Oct.23, 2002; 60/574,117, filed May 24, 2004; 10/423,828, filed Apr. 25,2003; 60/578,962, filed Jun. 12, 2004; and 10/870,690 filed Jun. 16,2004; and US20020164770, which are incorporated by reference herein. Forexample, preferred expression vectors of the invention include, but arenot limited to, vectors comprising pol I promoter and terminatorsequences or vectors using both the pol I and pol II promoters “thepolI/polII promoter system” (e.g., Zobel et al., Nucl. Acids Res. 1993,21:3607; US20020164770; Neumann et al., Proc. Natl. Acad. Sci. USA 1999,96:9345; Fodor et al., J. Virol. 1999, 73:9679; and US20030035814). Thereassortants produced can include the HA and NA genes arranged with the6 other influenza genes from the A/Ann Arbor/6/60 donor strain (and/orderivatives and modifications thereof), the PR8 donor strain backbone,the A/Leningrad/17 donor strain backbone, etc. Other backbone strainsare described, for example, in 20040137013 and 20030147916, which areincorporated by reference herein.

Additional Expression Elements

Most commonly, the genome segment encoding the influenza virus HA and/orNA protein includes any additional sequences necessary for itsexpression, including translation into a functional viral protein. Inother situations, a minigene, or other artificial construct encoding theviral proteins, e.g., an HA and/or NA protein, can be employed. Again,in such case, it is often desirable to include specific initiationsignals that aid in the efficient translation of the heterologous codingsequence. These signals can include, e.g., the ATG initiation codon andadjacent sequences. To insure translation of the entire insert, theinitiation codon is inserted in the correct reading frame relative tothe viral protein. Exogenous transcriptional elements and initiationcodons can be of various origins, both natural and synthetic. Theefficiency of expression can be enhanced by the inclusion of enhancersappropriate to the cell system in use.

If desired, polynucleotide sequences encoding additional expressedelements, such as signal sequences, secretion or localization sequences,and the like can be incorporated into the vector, usually, in-frame withthe polynucleotide sequence of interest, e.g., to target polypeptideexpression to a desired cellular compartment, membrane, or organelle, orto direct polypeptide secretion to the periplasmic space or into thecell culture media. Such sequences are known to those of skill, andinclude secretion leader peptides, organelle targeting sequences (e.g.,nuclear localization sequences, ER retention signals, mitochondrialtransit sequences), membrane localization/anchor sequences (e.g., stoptransfer sequences, GPI anchor sequences), and the like.

Where translation of a polypeptide encoded by a nucleic acid sequence ofthe invention is desired, additional translation specific initiationsignals can improve the efficiency of translation. These signals caninclude, e.g., an ATG initiation codon and adjacent sequences, an IRESregion, etc. In some cases, for example, full-length cDNA molecules orchromosomal segments including a coding sequence incorporating, e.g., apolynucleotide sequence of the invention (e.g., as in the sequencesherein), a translation initiation codon and associated sequence elementsare inserted into the appropriate expression vector simultaneously withthe polynucleotide sequence of interest. In such cases, additionaltranslational control signals frequently are not required. However, incases where only a polypeptide coding sequence, or a portion thereof, isinserted, exogenous translational control signals, including, e.g., anATG initiation codon is often provided for expression of the relevantsequence. The initiation codon is put in the correct reading frame toensure transcription of the polynucleotide sequence of interest.Exogenous transcriptional elements and initiation codons can be ofvarious origins, both natural and synthetic. The efficiency ofexpression can be enhanced by the inclusion of enhancers appropriate tothe cell system in use (see, e.g., Scharf D. et al. (1994) Results ProblCell Differ 20:125-62; Bittner et al. (1987) Methods in Enzymol153:516-544).

Production of Recombinant Virus

Negative strand RNA viruses can be genetically engineered and recoveredusing a recombinant reverse genetics approach (see, e.g., U.S. Pat. No.5,166,057 to Palese et al.). Such method was originally applied toengineer influenza viral genomes (Luytjes et al. (1989) Cell59:1107-1113; Enami et al. (1990) Proc. Natl. Acad. Sci. USA92:11563-11567), and has been successfully applied to a wide variety ofsegmented and nonsegmented negative strand RNA viruses, e.g., rabies(Schnell et al. (1994) EMBO J. 13: 4195-4203); VSV (Lawson et al. (1995)Proc. Natl. Acad. Sci. USA 92: 4477-4481); measles virus (Radecke et al.(1995) EMBO J. 14:5773-5784); rinderpest virus (Baron & Barrett (1997)J. Virol. 71: 1265-1271); human parainfluenza virus (Hoffman & Banerjee(1997) J. Virol. 71: 3272-3277; Dubin et al. (1997) Virology235:323-332); SV5 (He et al. (1997) Virology 237:249-260); caninedistemper virus (Gassen et al. (2000) J. Virol. 74:10737-44); and Sendaivirus (Park et al. (1991) Proc. Natl. Acad. Sci. USA 88: 5537-5541; Katoet al. (1996) Genes to Cells 1:569-579). Those of skill in the art willbe familiar with these and similar techniques to produce influenza viruscomprising the HA and NA sequences of the invention. Recombinantinfluenza viruses produced according to such methods are also a featureof the invention, as are recombinant influenza virus comprising one ormore nucleic acids and/or polypeptides of the invention.

Cell Culture and Expression Hosts

The present invention also relates to host cells that are introduced(transduced, transformed or transfected) with vectors of the invention,and the production of polypeptides of the invention by recombinanttechniques. Host cells are genetically engineered (i.e., transduced,transformed or transfected) with a vector, such as an expression vector,of this invention. As described above, the vector can be in the form ofa plasmid, a viral particle, a phage, etc. Examples of appropriateexpression hosts include: bacterial cells, such as E. coli,Streptomyces, and Salmonella typhimurium; fungal cells, such asSaccharomyces cerevisiae, Pichia pastoris, and Neurospora crassa; orinsect cells such as Drosophila and Spodoptera frugiperda.

Most commonly, mammalian cells are used to culture the HA and NAmolecules of the invention. Suitable host cells for the replication ofinfluenza virus include, e.g., Vero cells, BHK cells, MDCK cells, 293cells and COS cells, including 293T cells, COS7 cells or the like.Commonly, co-cultures including two of the above cell lines, e.g., MDCKcells and either 293T or COS cells are employed at a ratio, e.g., of1:1, to improve replication efficiency. Typically, cells are cultured ina standard commercial culture medium, such as Dulbecco's modifiedEagle's medium supplemented with serum (e.g., 10% fetal bovine serum),or in serum free medium, under controlled humidity and CO₂ concentrationsuitable for maintaining neutral buffered pH (e.g., at pH between 7.0and 7.2). Optionally, the medium contains antibiotics to preventbacterial growth, e.g., penicillin, streptomycin, etc., and/oradditional nutrients, such as L-glutamine, sodium pyruvate,non-essential amino acids, additional supplements to promote favorablegrowth characteristics, e.g., trypsin, β-mercaptoethanol, and the like.

The engineered host cells can be cultured in conventional nutrient mediamodified as appropriate for activating promoters, selectingtransformants, or amplifying the inserted polynucleotide sequences. Theculture conditions, such as temperature, pH and the like, are typicallythose previously used with the particular host cell selected forexpression, and will be apparent to those skilled in the art and in thereferences cited herein, including, e.g., Freshney (1994) Culture ofAnimal Cells, a Manual of Basic Technique, 3^(rd) edition, Wiley-Liss,New York and the references cited therein. Other helpful referencesinclude, e.g., Paul (1975) Cell and Tissue Culture, 5^(th) ed.,Livingston, Edinburgh; Adams (1980) Laboratory Techniques inBiochemistry and Molecular Biology-Cell Culture for Biochemists, Workand Burdon (eds.) Elsevier, Amsterdam. Additional details regardingtissue culture procedures of particular interest in the production ofinfluenza virus in vitro include, e.g., Merten et al. (1996) Productionof influenza virus in cell cultures for vaccine preparation. in Cohenand Shafferman (eds.) Novel Strategies in Design and Production ofVaccines, which is incorporated herein in its entirety for all purposes.Additionally, variations in such procedures adapted to the presentinvention are readily determined through routine experimentation andwill be familiar to those skilled in the art.

Cells for production of influenza virus (e.g., having the HA and/or NAsequences of the invention) can be cultured in serum-containing or serumfree medium. In some cases, e.g., for the preparation of purifiedviruses, it is typically desirable to grow the host cells in serum freeconditions. Cells can be cultured in small scale, e.g., less than 25 mlmedium, culture tubes or flasks or in large flasks with agitation, inrotator bottles, or on microcarrier beads (e.g., DEAE-Dextranmicrocarrier beads, such as Dormacell, Pfeifer & Langen; Superbead, FlowLaboratories; styrene copolymer-tri-methylamine beads, such as Hillex,SoloHill, Ann Arbor) in flasks, bottles or reactor cultures.Microcarrier beads are small spheres (in the range of 100-200 microns indiameter) that provide a large surface area for adherent cell growth pervolume of cell culture. For example a single liter of medium can includemore than 20 million microcarrier beads providing greater than 8000square centimeters of growth surface. For commercial production ofviruses, e.g., for vaccine production, it is often desirable to culturethe cells in a bioreactor or fermenter. Bioreactors are available involumes from under 1 liter to in excess of 100 liters, e.g., Cyto3Bioreactor (Osmonics, Minnetonka, Minn.); NBS bioreactors (New BrunswickScientific, Edison, N.J.); laboratory and commercial scale bioreactorsfrom B. Braun Biotech International (B. Braun Biotech, Melsungen,Germany).

Regardless of the culture volume, in many desired aspects of the currentinvention, it is important that the cultures be maintained at anappropriate temperature, to insure efficient recovery of recombinantand/or reassortant influenza virus using temperature dependent multiplasmid systems (see, e.g., Multi-Plasmid System for the Production ofInfluenza Virus, U.S. Application No. 60/420,708, filed Oct. 23, 2002,U.S. application Ser. No. 10/423,828, filed Apr. 25, 2003, and U.S.Application No. 60/574,117, filed May 24, 2004), heating of virussolutions for filtration, etc. Typically, a regulator, e.g., athermostat, or other device for sensing and maintaining the temperatureof the cell culture system and/or other solution, is employed to insurethat the temperature is at the correct level during the appropriateperiod (e.g., virus replication, etc.).

In some embodiments herein (e.g., wherein reassorted viruses are to beproduced from segments on vectors) vectors comprising influenza genomesegments are introduced (e.g., transfected) into host cells according tomethods well known in the art for introducing heterologous nucleic acidsinto eukaryotic cells, including, e.g., calcium phosphateco-precipitation, electroporation, microinjection, lipofection, andtransfection employing polyamine transfection reagents. For example,vectors, e.g., plasmids, can be transfected into host cells, such as COScells, 293T cells or combinations of COS or 293T cells and MDCK cells,using the polyamine transfection reagent TransIT-LT 1 (Mirus) accordingto the manufacturer's instructions in order to produce reassortedviruses, etc. Thus, in one example, approximately 1 μg of each vector isintroduced into a population of host cells with approximately 2 μl ofTransIT-LT 1 diluted in 160 μl medium, preferably serum-free medium, ina total volume of 200 μl. The DNA: transfection reagent mixtures areincubated at room temperature for 45 minutes followed by addition of 800μl of medium. The transfection mixture is added to the host cells, andthe cells are cultured as described via other methods well known tothose skilled in the art. Accordingly, for the production of recombinantor reassortant viruses in cell culture, vectors incorporating each ofthe 8 genome segments, (PB2, PB1, PA, NP, M, NS, HA and NA, e.g., of theinvention) are mixed with approximately 20 μl TransIT-LT1 andtransfected into host cells. Optionally, serum-containing medium isreplaced prior to transfection with serum-free medium, e.g., Opti-MEM I,and incubated for 4-6 hours.

Alternatively, electroporation can be employed to introduce such vectorsincorporating influenza genome segments into host cells. For example,plasmid vectors incorporating an influenza A or influenza B virus arefavorably introduced into Vero cells using electroporation according tothe following procedure. In brief, approximately 5×10⁶ Vero cells, e.g.,grown in Modified Eagle's Medium (MEM) supplemented with 10% FetalBovine Serum (FBS) are resuspended in 0.4 ml OptiMEM and placed in anelectroporation cuvette. Twenty micrograms of DNA in a volume of up to25 μl is added to the cells in the cuvette, which is then mixed gentlyby tapping. Electroporation is performed according to the manufacturer'sinstructions (e.g., BioRad Gene Pulser II with Capacitance Extender Plusconnected) at 300 volts, 950 microFarads with a time constant of between28-33 msec. The cells are remixed by gently tapping and approximately1-2 minutes following electroporation 0.7 ml MEM with 10% FBS is addeddirectly to the cuvette. The cells are then transferred to two wells ofa standard 6 well tissue culture dish containing 2 ml MEM, 10% FBS. Thecuvette is washed to recover any remaining cells and the wash suspensionis divided between the two wells. Final volume is approximately 3.5 mL.The cells are then incubated under conditions permissive for viralgrowth, e.g., at approximately 33° C. for cold adapted strains.

In mammalian host cells, a number of expression systems, such asviral-based systems, can be utilized. In cases where an adenovirus isused as an expression vector, a coding sequence is optionally ligatedinto an adenovirus transcription/translation complex consisting of thelate promoter and tripartite leader sequence. Insertion in anonessential E1 or E3 region of the viral genome will result in a viablevirus capable of expressing the polypeptides of interest in infectedhost cells (Logan and Shenk (1984) Proc Natl Acad Sci 81:3655-3659). Inaddition, transcription enhancers, such as the rous sarcoma virus (RSV)enhancer, can be used to increase expression in mammalian host cells.

A host cell strain is optionally chosen for its ability to modulate theexpression of the inserted sequences or to process the expressed proteinin the desired fashion. Such modifications of the protein include, butare not limited to, acetylation, carboxylation, glycosylation,phosphorylation, lipidation and acylation. Post-translationalprocessing, which cleaves a precursor form into a mature form, of theprotein is sometimes important for correct insertion, folding and/orfunction. Additionally proper location within a host cell (e.g., on thecell surface) is also important. Different host cells such as COS, CHO,BHK, MDCK, 293, 293T, COS7, etc. have specific cellular machinery andcharacteristic mechanisms for such post-translational activities and canbe chosen to ensure the correct modification and processing of thecurrent introduced, foreign protein.

For long-term, high-yield production of recombinant proteins encoded by,or having subsequences encoded by, the polynucleotides of the invention,stable expression systems are optionally used. For example, cell lines,stably expressing a polypeptide of the invention, are transfected usingexpression vectors that contain viral origins of replication orendogenous expression elements and a selectable marker gene. Forexample, following the introduction of the vector, cells are allowed togrow for 1-2 days in an enriched media before they are switched toselective media. The purpose of the selectable marker is to conferresistance to selection, and its presence allows growth and recovery ofcells that successfully express the introduced sequences. Thus,resistant clumps of stably transformed cells, e.g., derived from singlecell type, can be proliferated using tissue culture techniquesappropriate to the cell type.

Host cells transformed with a nucleotide sequence encoding a polypeptideof the invention are optionally cultured under conditions suitable forthe expression and recovery of the encoded protein from cell culture.The cells expressing said protein can be sorted, isolated and/orpurified. The protein or fragment thereof produced by a recombinant cellcan be secreted, membrane-bound, or retained intracellularly, dependingon the sequence (e.g., depending upon fusion proteins encoding amembrane retention signal or the like) and/or the vector used.

Expression products corresponding to the nucleic acids of the inventioncan also be produced in non-animal cells such as plants, yeast, fungi,bacteria and the like. In addition to Sambrook, Berger and Ausubel, allinfra, details regarding cell culture can be found in Payne et al.(1992) Plant Cell and Tissue Culture in Liquid Systems John Wiley &Sons, Inc. New York, N.Y.; Gamborg and Phillips (eds.) (1995) Plant CellTissue and Organ Culture; Fundamental Methods Springer Lab Manual,Springer-Verlag (Berlin Heidelberg New York) and Atlas and Parks (eds.)The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, Fla.

In bacterial systems, a number of expression vectors can be selecteddepending upon the use intended for the expressed product. For example,when large quantities of a polypeptide or fragments thereof are neededfor the production of antibodies, vectors that direct high-levelexpression of fusion proteins that are readily purified are favorablyemployed. Such vectors include, but are not limited to, multifunctionalE. coli cloning and expression vectors such as BLUESCRIPT (Stratagene),in which the coding sequence of interest, e.g., sequences comprisingthose found herein, etc., can be ligated into the vector in-frame withsequences for the amino-terminal translation initiating methionine andthe subsequent 7 residues of beta-galactosidase producing acatalytically active beta galactosidase fusion protein; pIN vectors (VanHeeke & Schuster (1989) J Biol Chem 264:5503-5509); pET vectors(Novagen, Madison Wis.); and the like. Similarly, in the yeastSaccharomyces cerevisiae a number of vectors containing constitutive orinducible promoters such as alpha factor, alcohol oxidase and PGH can beused for production of the desired expression products. For reviews, seeAusubel, infra, and Grant et al., (1987); Methods in Enzymology153:516-544.

Nucleic Acid Hybridization

Comparative hybridization can be used to identify nucleic acids (e.g.,SEQ ID NO: 1-10, SEQ ID NO: 21-26, SEQ ID NO:33-38, SEQ ID NO:45) of theinvention, including conservative variations of nucleic acids of theinvention. This comparative hybridization method is a preferred methodof distinguishing nucleic acids of the invention. In addition, targetnucleic acids which hybridize to the nucleic acids represented by, e.g.,those shown herein under high, ultra-high and ultra-ultra-highstringency conditions are features of the invention. Examples of suchnucleic acids include those with one or a few silent or conservativenucleic acid substitutions as compared to a given nucleic acid sequence.

A test target nucleic acid is said to specifically hybridize to a probenucleic acid when it hybridizes at least one-half as well to the probeas to the perfectly matched complementary target, i.e., with a signal tonoise ratio at least one-half as high as hybridization of the probe andtarget under conditions in which a perfectly matched probe binds to aperfectly matched complementary target with a signal to noise ratio thatis at least about 5×-10× as high as that observed for hybridization toany of the unmatched target nucleic acids.

Nucleic acids “hybridize” when they associate, typically in solution.Nucleic acids hybridize due to a variety of well-characterizedphysico-chemical forces, such as hydrogen bonding, solvent exclusion,base stacking and the like. Numerous protocols for nucleic acidhybridization are well known in the art. An extensive guide to thehybridization of nucleic acids is found in Tijssen (1993) LaboratoryTechniques in Biochemistry and Molecular Biology—Hybridization withNucleic Acid Probes part I chapter 2, “Overview of principles ofhybridization and the strategy of nucleic acid probe assays,” (Elsevier,N.Y.), as well as in Ausubel, Sambrook, and Berger and Kimmel, allbelow. Hames and Higgins (1995) Gene Probes 1 IRL Press at OxfordUniversity Press, Oxford, England, (Hames and Higgins 1) and Hames andHiggins (1995) Gene Probes 2 IRL Press at Oxford University Press,Oxford, England (Hames and Higgins 2) provide details on the synthesis,labeling, detection and quantification of DNA and RNA, includingoligonucleotides.

An example of stringent hybridization conditions for hybridization ofcomplementary nucleic acids which have more than 100 complementaryresidues on a filter in a Southern or northern blot is 50% formalin with1 mg of heparin at 42° C., with the hybridization being carried outovernight. An example of stringent wash conditions comprises a 0.2× SSCwash at 65° C. for 15 minutes (see, Sambrook, infra for a description ofSSC buffer and other nucleic acid hybridization parameters). Often thehigh stringency wash is preceded by a low stringency wash to removebackground probe signal. An example low stringency wash is 2× SSC at 40°C. for 15 minutes. In general, a signal to noise ratio of 5× (or higher)than that observed for an unrelated probe in the particularhybridization assay indicates detection of a specific hybridization.

After hybridization, unhybridized nucleic acids can be removed by aseries of washes, the stringency of which can be adjusted depending uponthe desired results. Low stringency washing conditions (e.g., usinghigher salt and lower temperature) increase sensitivity, but can producenonspecific hybridization signals and high background signals. Higherstringency conditions (e.g., using lower salt and higher temperaturethat is closer to the T_(m)) lower the background signal, typically withprimarily the specific signal remaining. See, also, Rapley, R. andWalker, J. M. eds., Molecular Biomethods Handbook (Humana Press, Inc.1998).

“Stringent hybridization wash conditions” in the context of nucleic acidhybridization experiments such as Southern and northern hybridizationsare sequence dependent, and are different under different environmentalparameters. An extensive guide to the hybridization of nucleic acids isfound in Tijssen (1993), supra, and in Hames and Higgins, 1 and 2.Stringent hybridization and wash conditions can easily be determinedempirically for any test nucleic acid. For example, in determininghighly stringent hybridization and wash conditions, the hybridizationand wash conditions are gradually increased (e.g., by increasingtemperature, decreasing salt concentration, increasing detergentconcentration and/or increasing the concentration of organic solventssuch as formalin in the hybridization or wash), until a selected set ofcriteria is met. For example, the hybridization and wash conditions aregradually increased until a probe binds to a perfectly matchedcomplementary target with a signal to noise ratio that is at least 5× ashigh as that observed for hybridization of the probe to an unmatchedtarget.

In general, a signal to noise ratio of at least 2× (or higher, e.g., atleast 5×, 10×, 20×, 50×, 100×, or more) than that observed for anunrelated probe in the particular hybridization assay indicatesdetection of a specific hybridization. Detection of at least stringenthybridization between two sequences in the context of the presentinvention indicates relatively strong structural similarity to, e.g.,the nucleic acids of the present invention provided in the sequencelistings herein.

“Very stringent” conditions are selected to be equal to the thermalmelting point (T_(m)) for a particular probe. The T_(m) is thetemperature (under defined ionic strength and pH) at which 50% of thetest sequence hybridizes to a perfectly matched probe. For the purposesof the present invention, generally, “highly stringent” hybridizationand wash conditions are selected to be about 5° C. lower than the T_(m)for the specific sequence at a defined ionic strength and pH (as notedbelow, highly stringent conditions can also be referred to incomparative terms). Target sequences that are closely related oridentical to the nucleotide sequence of interest (e.g., “probe”) can beidentified under stringent or highly stringent conditions. Lowerstringency conditions are appropriate for sequences that are lesscomplementary.

“Ultra high-stringency” hybridization and wash conditions are those inwhich the stringency of hybridization and wash conditions are increaseduntil the signal to noise ratio for binding of a probe to a perfectlymatched complementary target nucleic acid is at least 10× as high asthat observed for hybridization to any unmatched target nucleic acids. Atarget nucleic acid which hybridizes to a probe under such conditions,with a signal to noise ratio of at least one-half that of the perfectlymatched complementary target nucleic acid is said to bind to the probeunder ultra-high stringency conditions.

In determining stringent or highly stringent hybridization (or even morestringent hybridization) and wash conditions, the hybridization and washconditions are gradually increased (e.g., by increasing temperature,decreasing salt concentration, increasing detergent concentration and/orincreasing the concentration of organic solvents, such as formamide, inthe hybridization or wash), until a selected set of criteria are met.For example, the hybridization and wash conditions are graduallyincreased until a probe comprising one or more polynucleotide sequencesof the invention, e.g., sequences or unique subsequences selected fromthose given herein (e.g., SEQ ID NO: 1-10, 21-26, 33-38, SEQ ID NO:45)and/or complementary polynucleotide sequences, binds to a perfectlymatched complementary target (again, a nucleic acid comprising one ormore nucleic acid sequences or subsequences selected from those givenherein and/or complementary polynucleotide sequences thereof), with asignal to noise ratio that is at least 2× (and optionally 5×, 10×, or100× or more) as high as that observed for hybridization of the probe toan unmatched target (e.g., a polynucleotide sequence comprising one ormore sequences or subsequences selected from known influenza sequencespresent in public databases such as GenBank at the time of filing,and/or complementary polynucleotide sequences thereof), as desired.

Using the polynucleotides of the invention, or subsequences thereof,novel target nucleic acids can be obtained; such target nucleic acidsare also a feature of the invention. For example, such target nucleicacids include sequences that hybridize under stringent conditions to aunique oligonucleotide probe corresponding to any of the polynucleotidesof the invention, e.g., SEQ ID NO: 1-10, 21-26, 33-38, 45).

Similarly, even higher levels of stringency can be determined bygradually increasing the hybridization and/or wash conditions of therelevant hybridization assay. For example, those in which the stringencyof hybridization and wash conditions are increased until the signal tonoise ratio for binding of the probe to the perfectly matchedcomplementary target nucleic acid is at least 10×, 20×, 50×, 100×, or500× or more as high as that observed for hybridization to any unmatchedtarget nucleic acids. The particular signal will depend on the labelused in the relevant assay, e.g., a fluorescent label, a calorimetriclabel, a radioactive label, or the like. A target nucleic acid whichhybridizes to a probe under such conditions, with a signal to noiseratio of at least one-half that of the perfectly matched complementarytarget nucleic acid is said to bind to the probe under ultra-ultra-highstringency conditions and are also features of the invention.

Nucleic acids that do not hybridize to each other under stringentconditions are still substantially identical if the polypeptides whichthey encode are substantially identical. This occurs, e.g., when a copyof a nucleic acid is created using the maximum codon degeneracypermitted by the genetic code.

Cloning, Mutagenesis and Expression of Biomolecules of Interest

General texts which describe molecular biological techniques, which areapplicable to the present invention, such as cloning, mutation, cellculture and the like, include Berger and Kimmel, Guide to MolecularCloning Techniques Methods in Enzymology volume 152 Academic Press,Inc., San Diego, Calif. (Berger); Sambrook et al., Molecular Cloning—ALaboratory Manual (3rd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory,Cold Spring Harbor, N.Y., 2000 (“Sambrook”) and Current Protocols inMolecular Biology, F. M. Ausubel et al., eds., Current Protocols, ajoint venture between Greene Publishing Associates, Inc. and John Wiley& Sons, Inc., (supplemented through 2002) (“Ausubel”)). These textsdescribe mutagenesis, the use of vectors, promoters and many otherrelevant topics related to, e.g., the generation of HA and/or NAmolecules, etc.

Various types of mutagenesis are optionally used in the presentinvention, e.g., to produce and/or isolate, e.g., novel or newlyisolated HA and/or NA molecules and/or to further modify/mutate thepolypeptides (e.g., HA and NA molecules as in SEQ ID NO: 11-20 or 27-32or 39-44) of the invention. They include but are not limited tosite-directed, random point mutagenesis, homologous recombination (DNAshuffling), mutagenesis using uracil containing templates,oligonucleotide-directed mutagenesis, phosphorothioate-modified modifiedDNA mutagenesis, mutagenesis using gapped duplex DNA or the like.Additional suitable methods include point mismatch repair, mutagenesisusing repair-deficient host strains, restriction-selection andrestriction-purification, deletion mutagenesis, mutagenesis by totalgene synthesis, double-strand break repair, and the like. Mutagenesis,e.g., involving chimeric constructs, is also included in the presentinvention. In one embodiment, mutagenesis can be guided by knowninformation of the naturally occurring molecule or altered or mutatednaturally occurring molecule, e.g., sequence, sequence comparisons,physical properties, crystal structure or the like.

The above texts and examples found herein describe these procedures aswell as the following publications (and references cited within):Sieber, et al., Nature Biotechnology, 19:456-460 (2001); Ling et al.,Approaches to DNA mutagenesis: an overview, Anal Biochem 254(2): 157-178(1997); Dale et al., Oligonucleotide-directed random mutagenesis usingthe phosphorothioate method, Methods Mol Biol 57:369-374 (1996); I. A.Lorimer, I. Pastan, Nucleic Acids Res 23, 3067-8 (1995); W. P. C.Stemmer, Nature 370, 389-91 (1994); Arnold, Protein engineering forunusual environments, Current Opinion in Biotechnology 4:450-455 (1993);Bass et al., Mutant Trp repressors with new DNA-binding specificities,Science 242:240-245 (1988); Fritz et al., Oligonucleotide-directedconstruction of mutations: a gapped duplex DNA procedure withoutenzymatic reactions in vitro, Nucl Acids Res 16: 6987-6999 (1988);Kramer et al., Improved enzymatic in vitro reactions in the gappedduplex DNA approach to oligonucleotide-directed construction ofmutations, Nucl Acids Res 16: 7207 (1988); Sakamar and Khorana, Totalsynthesis and expression of a gene for the a-subunit of bovine rod outersegment guanine nucleotide-binding protein (transducin), Nucl Acids Res14: 6361-6372 (1988); Sayers et al., Y-T Exonucleases inphosphorothioate-based oligonucleotide-directed mutagenesis, Nucl AcidsRes 16:791-802 (1988); Sayers et al., Strand specific cleavage ofphosphorothioate-containing DNA by reaction with restrictionendonucleases in the presence of ethidium bromide, (1988) Nucl Acids Res16: 803-814; Carter, Improved oligonucleotide-directed mutagenesis usingM13 vectors, Methods in Enzymol 154: 382-403 (1987); Kramer & FritzOligonucleotide-directed construction of mutations via gapped duplexDNA, Methods in Enzymol 154:350-367 (1987); Kunkel, The efficiency ofoligonucleotide directed mutagenesis, in Nucleic Acids & MolecularBiology (Eckstein, F. and Lilley, D. M. J. eds., Springer Verlag,Berlin)) (1987); Kunkel et al., Rapid and efficient site-specificmutagenesis without phenotypic selection, Methods in Enzymol 154,367-382 (1987); Zoller & Smith, Oligonucleotide-directed mutagenesis: asimple method using two oligonucleotide primers and a single-strandedDNA template, Methods in Enzymol 154:329-350 (1987); Carter,Site-directed mutagenesis, Biochem J 237:1-7 (1986); Eghtedarzadeh &Henikoff, Use of oligonucleotides to generate large deletions, NuclAcids Res 14: 5115 (1986); Mandecki, Oligonucleotide-directeddouble-strand break repair in plasmids of Escherichia coli: a method forsite-specific mutagenesis, Proc Natl Acad Sci USA, 83:7177-7181 (1986);Nakamaye & Eckstein, Inhibition of restriction endonuclease Nci Icleavage by phosphorothioate groups and its application tooligonucleotide-directed mutagenesis, Nucl Acids Res 14: 9679-9698(1986); Wells et al., Importance of hydrogen-bond formation instabilizing the transition state of subtilisin, Phil Trans R Soc Lond A317: 415-423 (1986); Botstein & Shortle, Strategies and applications ofin vitro mutagenesis, Science 229:1193-1201 (1985); Carter et al.,Improved oligonucleotide site-directed mutagenesis using M13 vectors,Nucl Acids Res 13: 4431-4443 (1985); Grundström et al.,Oligonucleotide-directed mutagenesis by microscale ‘shot-gun ’ genesynthesis, Nucl Acids Res 13:3305-3316 (1985); Kunkel, Rapid andefficient site-specific mutagenesis without phenotypic selection, ProcNatl Acad Sci USA 82:488-492 (1985); Smith, In vitro mutagenesis, AnnRev Genet. 19:423-462 (1985); Taylor et al., The use ofphosphorothioate-modified DNA in restriction enzyme reactions to preparenicked DNA, Nucl Acids Res 13: 8749-8764 (1985); Taylor et al., Therapid generation of oligonucleotide-directed mutations at high frequencyusing phosphorothioate-modified DNA, Nucl Acids Res 13: 8765-8787(1985); Wells et al., Cassette mutagenesis: an efficient method forgeneration of multiple mutations at defined sites, Gene 34:315-323(1985); Kramer et al., The gapped duplex DNA approach tooligonucleotide-directed mutation construction, Nucl Acids Res 12:9441-9456 (1984); Kramer et al., Point Mismatch Repair, Cell 38:879-887(1984); Nambiar et al., Total synthesis and cloning of a gene coding forthe ribonuclease S protein, Science 223: 1299-1301 (1984); Zoller &Smith, Oligonucleotide-directed mutagenesis of DNA fragments cloned intoM13 vectors, Methods in Enzymol 100:468-500 (1983); and Zoller & Smith,Oligonucleotide-directed mutagenesis using M3-derived vectors: anefficient and general procedure for the production of point mutations inany DNA fragment, Nucl Acids Res 10:6487-6500 (1982). Additional detailson many of the above methods can be found in Methods in Enzymol Volume154, which also describes useful controls for trouble-shooting problemswith various mutagenesis, gene isolation, expression, and other methods.

Oligonucleotides, e.g., for use in mutagenesis of the present invention,e.g., mutating libraries of the HA and/or NA molecules of the invention,or altering such, are typically synthesized chemically according to thesolid phase phosphoramidite triester method described by Beaucage andCaruthers, Tetrahedron Letts 22(20):1859-1862, (1981) e.g., using anautomated synthesizer, as described in Needham-VanDevanter et al.,Nucleic Acids Res, 12:6159-6168 (1984).

In addition, essentially any nucleic acid can be custom or standardordered from any of a variety of commercial sources, such as The MidlandCertified Reagent Company (mcrc@oligos.com), The Great American GeneCompany (www.genco.com), ExpressGen Inc. (www.expressgen.com), OperonTechnologies Inc. (Alameda, Calif.) and many others. Similarly, peptidesand antibodies can be custom ordered from any of a variety of sources,such as PeptidoGenic (available at pkim@ccnet.com), HTI Bio-products,Inc. (www.htibio.com), BMA Biomedicals Ltd. (U.K.), Bio.Synthesis, Inc.,and many others.

The present invention also relates to host cells and organismscomprising a HA and/or NA molecule or other polypeptide and/or nucleicacid of the invention, e.g., SEQ ID NOS:1-45. Host cells are geneticallyengineered (e.g., transformed, transduced or transfected) with thevectors of this invention, which can be, for example, a cloning vectoror an expression vector. The vector can be, for example, in the form ofa plasmid, a bacterium, a virus, a naked polynucleotide, or a conjugatedpolynucleotide. The vectors are introduced into cells and/ormicroorganisms by standard methods including electroporation (see, Fromet al., Proc Natl Acad Sci USA 82, 5824 (1985), infection by viralvectors, high velocity ballistic penetration by small particles with thenucleic acid either within the matrix of small beads or particles, or onthe surface (Klein et al., Nature 327, 70-73 (1987)). Berger, Sambrook,and Ausubel provide a variety of appropriate transformation methods.See, above.

Several well-known methods of introducing target nucleic acids intobacterial cells are available, any of which can be used in the presentinvention. These include: fusion of the recipient cells with bacterialprotoplasts containing the DNA, electroporation, projectile bombardment,and infection with viral vectors, etc. Bacterial cells can be used toamplify the number of plasmids containing DNA constructs of thisinvention. The bacteria are grown to log phase and the plasmids withinthe bacteria can be isolated by a variety of methods known in the art(see, for instance, Sambrook). In addition, a plethora of kits arecommercially available for the purification of plasmids from bacteria,(see, e.g., EasyPrep™, FlexiPrep™, both from Pharmacia Biotech;StrataClean™, from Stratagene; and, QIAprep™ from Qiagen). The isolatedand purified plasmids are then further manipulated to produce otherplasmids, used to transfect cells or incorporated into related vectorsto infect organisms. Typical vectors contain transcription andtranslation terminators, transcription and translation initiationsequences, and promoters useful for regulation of the expression of theparticular target nucleic acid. The vectors optionally comprise genericexpression cassettes containing at least one independent terminatorsequence, sequences permitting replication of the cassette ineukaryotes, or prokaryotes, or both, (e.g., shuttle vectors) andselection markers for both prokaryotic and eukaryotic systems. Vectorsare suitable for replication and integration in prokaryotes, eukaryotes,or optionally both. See, Giliman & Smith, Gene 8:81 (1979); Roberts, etal., Nature, 328:731 (1987); Schneider, B., et al., Protein Expr Purif6435:10 (1995); Ausubel, Sambrook, Berger (all supra). A catalogue ofBacteria and Bacteriophages useful for cloning is provided, e.g., by theATCC, e.g., The ATCC Catalogue of Bacteria and Bacteriophage (1992)Gherna et al. (eds.) published by the ATCC. Additional basic proceduresfor sequencing, cloning and other aspects of molecular biology andunderlying theoretical considerations are also found in Watson et al.(1992) Recombinant DNA Second Edition Scientific American Books, N.Y.See, above. Further vectors useful with the sequences herein areillustrated above in the section concerning production of influenzavirus for vaccines and the references cited therein.

Polypeptide Production and Recovery

Following transduction of a suitable host cell line or strain and growthof the host cells to an appropriate cell density, the selected promoteris induced by appropriate means (e.g., temperature shift or chemicalinduction) and cells are cultured for an additional period. In someembodiments, a secreted polypeptide product, e.g., a HA and/or NApolypeptide as in a secreted fusion protein form, etc., is thenrecovered from the culture medium. In other embodiments, a virusparticle containing a HA and/or a NA polypeptide of the invention isproduced from the cell. Alternatively, cells can be harvested bycentrifugation, disrupted by physical or chemical means, and theresulting crude extract retained for further purification. Eukaryotic ormicrobial cells employed in expression of proteins can be disrupted byany convenient method, including freeze-thaw cycling, sonication,mechanical disruption, or use of cell lysing agents, or other methods,which are well know to those skilled in the art. Additionally, cellsexpressing a HA and/or a NA polypeptide product of the invention can beutilized without separating the polypeptide from the cell. In suchsituations, the polypeptide of the invention is optionally expressed onthe cell surface and is examined thus (e.g., by having HA and/or NAmolecules (or fragments thereof, e.g., comprising fusion proteins or thelike) on the cell surface bind antibodies, etc. Such cells are alsofeatures of the invention.

Expressed polypeptides can be recovered and purified from recombinantcell cultures by any of a number of methods well known in the art,including ammonium sulfate or ethanol precipitation, acid extraction,anion or cation exchange chromatography, phosphocellulosechromatography, hydrophobic interaction chromatography, affinitychromatography (e.g., using any of the tagging systems known to thoseskilled in the art), hydroxylapatite chromatography, and lectinchromatography. Protein refolding steps can be used, as desired, incompleting configuration of the mature protein. Also, high performanceliquid chromatography (HPLC) can be employed in the final purificationsteps. In addition to the references noted herein, a variety ofpurification methods are well known in the art, including, e.g., thoseset forth in Sandana (1997) Bioseparation of Proteins, Academic Press,Inc.; and Bollag et al. (1996) Protein Methods, 2^(nd) EditionWiley-Liss, NY; Walker (1996) The Protein Protocols Handbook HumanaPress, NJ, Harris and Angal (1990) Protein Purification Applications: APractical Approach IRL Press at Oxford, Oxford, England; Harris andAngal Protein Purification Methods: A Practical Approach IRL Press atOxford, Oxford, England; Scopes (1993) Protein Purification: Principlesand Practice 3^(rd) Edition Springer Verlag, NY; Janson and Ryden (1998)Protein Purification: Principles High Resolution Methods andApplications Second Edition Wiley-VCH, NY; and Walker (1998) ProteinProtocols on CD-ROM Humana Press, NJ.

When the expressed polypeptides of the invention are produced inviruses, the viruses are typically recovered from the culture medium, inwhich infected (transfected) cells have been grown. Typically, crudemedium is clarified prior to concentration of influenza viruses. Commonmethods include ultrafiltration, adsorption on barium sulfate andelution, and centrifugation. For example, crude medium from infectedcultures can first be clarified by centrifugation at, e.g., 1000-2000×gfor a time sufficient to remove cell debris and other large particulatematter, e.g., between 10 and 30 minutes. Optionally, the clarifiedmedium supernatant is then centrifuged to pellet the influenza viruses,e.g., at 15,000×g, for approximately 3-5 hours. Following resuspensionof the virus pellet in an appropriate buffer, such as STE (0.01 MTris-HCl; 0.15 M NaCl; 0.0001 M EDTA) or phosphate buffered saline (PBS)at pH 7.4, the virus is concentrated by density gradient centrifugationon sucrose (60%-12%) or potassium tartrate (50%-10%). Either continuousor step gradients, e.g., a sucrose gradient between 12% and 60% in four12% steps, are suitable. The gradients are centrifuged at a speed, andfor a time, sufficient for the viruses to concentrate into a visibleband for recovery. Alternatively, and for most large-scale commercialapplications, virus is elutriated from density gradients using azonal-centrifuge rotor operating in continuous mode. Additional detailssufficient to guide one of skill through the preparation of influenzaviruses from tissue culture are provided, e.g., in Furminger. VaccineProduction, in Nicholson et al. (eds.) Textbook of Influenza pp.324-332; Merten et al. (1996) Production of influenza virus in cellcultures for vaccine preparation, in Cohen & Shafferman (eds.) NovelStrategies in Design and Production of Vaccines pp. 141-151, and U.S.Pat. No. 5,690,937. If desired, the recovered viruses can be stored at−80° C. in the presence of sucrose-phosphate-glutamate (SPG) as astabilizer

Alternatively, cell-free transcription/translation systems can beemployed to produce polypeptides comprising an amino acid sequence orsubsequence of, e.g., the sequences given herein such as SEQ ID NOS:11-20 or 27-32 or 39-44, or encoded by the polynucleotide sequences ofthe invention, e.g., SEQ ID NOS: 1-10 or 21-26 or 33-38 or 45. A numberof suitable in vitro transcription and translation systems arecommercially available. A general guide to in vitro transcription andtranslation protocols is found in Tymms (1995) In vitro Transcriptionand Translation Protocols: Methods in Molecular Biology Volume 37,Garland Publishing, NY.

In addition, the polypeptides, or subsequences thereof, e.g.,subsequences comprising antigenic peptides, can be produced manually orby using an automated system, by direct peptide synthesis usingsolid-phase techniques (see, Stewart et al. (1969) Solid-Phase PeptideSynthesis, WH Freeman Co, San Francisco; Merrifield J (1963) J Am ChemSoc 85:2149-2154). Exemplary automated systems include the AppliedBiosystems 431A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.).If desired, subsequences can be chemically synthesized separately, andcombined using chemical methods to provide full-length polypeptides.

Modified Amino Acids

Expressed polypeptides of the invention can contain one or more modifiedamino acids. The presence of modified amino acids can be advantageousin, for example, (a) increasing polypeptide serum half-life, (b)reducing/increasing polypeptide antigenicity, (c) increasing polypeptidestorage stability, etc. Amino acid(s) are modified, for example,co-translationally or post-translationally during recombinant production(e.g., N-linked glycosylation at N—X—S/T motifs during expression inmammalian cells) or modified by synthetic means (e.g., via PEGylation).

Non-limiting examples of a modified amino acid include a glycosylatedamino acid, a sulfated amino acid, a prenlyated (e.g., farnesylated,geranylgeranylated) amino acid, an acetylated amino acid, an acylatedamino acid, a PEG-ylated amino acid, a biotinylated amino acid, acarboxylated amino acid, a phosphorylated amino acid, and the like, aswell as amino acids modified by conjugation to, e.g., lipid moieties orother organic derivatizing agents. References adequate to guide one ofskill in the modification of amino acids are replete throughout theliterature. Example protocols are found in Walker (1998) ProteinProtocols on CD-ROM Human Press, Towata, N.J.

Fusion Proteins

The present invention also provides fusion proteins comprising fusionsof the sequences of the invention (e.g., encoding HA and/or NApolypeptides as exampled by SEQ ID NOS: 11-20, 27-32, and 39-44) orfragments thereof with, e.g., immunoglobulins (or portions thereof),sequences encoding, e.g., GFP (green fluorescent protein), or othersimilar markers, etc. Nucleotide sequences encoding such fusion proteinsare another aspect of the invention. Fusion proteins of the inventionare optionally used for, e.g., similar applications (including, e.g.,therapeutic, prophylactic, diagnostic, experimental, etc. applicationsas described herein) as the non-fusion proteins of the invention. Inaddition to fusion with immunoglobulin sequences and marker sequences,the proteins of the invention are also optionally fused with, e.g.,sequences which allow sorting of the fusion proteins and/or targeting ofthe fusion proteins to specific cell types, regions, etc.

Antibodies

The polypeptides of the invention can be used to produce antibodiesspecific for the polypeptides given herein and/or polypeptides encodedby the polynucleotides of the invention, e.g., those shown herein, andconservative variants thereof. Antibodies specific for the abovementioned polypeptides are useful, e.g., for diagnostic and therapeuticpurposes, e.g., related to the activity, distribution, and expression oftarget polypeptides.

Antibodies specific for the polypeptides of the invention can begenerated by methods well known in the art. Such antibodies can include,but are not limited to, polyclonal, monoclonal, chimeric, humanized,single chain, Fab fragments and fragments produced by an Fab expressionlibrary.

Polypeptides do not require biological activity for antibody production(e.g., full length functional hemagglutinin or neuraminidase is notrequired). However, the polypeptide or oligopeptide must be antigenic.Peptides used to induce specific antibodies typically have an amino acidsequence of at least about 4 amino acids, and often at least 5 or 10amino acids. Short stretches of a polypeptide can be fused with anotherprotein, such as keyhole limpet hemocyanin, and antibody producedagainst the chimeric molecule.

Numerous methods for producing polyclonal and monoclonal antibodies areknown to those of skill in the art, and can be adapted to produceantibodies specific for the polypeptides of the invention, and/orencoded by the polynucleotide sequences of the invention, etc. See,e.g., Coligan (1991) Current Protocols in Immunology Wiley/Greene, NY;Paul (ed.) (1998) Fundamental Immunology Fourth Edition,Lippincott-Raven, Lippincott Williams & Wilkins; Harlow and Lane (1989)Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY; Stites etal. (eds.) Basic and Clinical Immunology (4th ed.) Lange MedicalPublications, Los Altos, Calif., and references cited therein; Goding(1986) Monoclonal Antibodies: Principles and Practice (2d ed.) AcademicPress, New York, N.Y.; and Kohler and Milstein (1975) Nature 256:495-497. Other suitable techniques for antibody preparation includeselection of libraries of recombinant antibodies in phage or similarvectors. See, Huse et al. (1989) Science 246: 1275-1281; and Ward, etal. (1989) Nature 341: 544-546. Specific monoclonal and polyclonalantibodies and antisera will usually bind with a K_(D) of, e.g., atleast about 0.1 μM, at least about 0.01 μM or better, and, typically andat least about 0.001 μM or better.

For certain therapeutic applications, humanized antibodies aredesirable. Detailed methods for preparation of chimeric (humanized)antibodies can be found in U.S. Pat. No. 5,482,856. Additional detailson humanization and other antibody production and engineering techniquescan be found in Borrebaeck (ed.) (1995) Antibody Engineering, 2^(nd)Edition Freeman and Company, NY (Borrebaeck); McCafferty et al. (1996)Antibody Engineering A Practical Approach IRL at Oxford Press, Oxford,England (McCafferty), and Paul (1995) Antibody Engineering ProtocolsHumana Press, Towata, N.J. (Paul). Additional details regarding specificprocedures can be found, e.g., in Ostberg et al. (1983), Hybridoma 2:361-367, Ostberg, U.S. Pat. No. 4,634,664, and Engelman et al., U.S.Pat. No. 4,634,666.

Defining Polypeptides by Immunoreactivity

Because the polypeptides of the invention provide a variety of newpolypeptide sequences (e.g., comprising HA and NA molecules), thepolypeptides also provide new structural features which can berecognized, e.g., in immunological assays. The generation of antiserawhich specifically bind the polypeptides of the invention, as well asthe polypeptides which are bound by such antisera, are features of theinvention.

For example, the invention includes polypeptides (e.g., HA and NAmolecules) that specifically bind to or that are specificallyimmunoreactive with an antibody or antisera generated against animmunogen comprising an amino acid sequence selected from one or more ofthe sequences given herein (e.g., SEQ ID NOS: 11-20, 27-32, and 39-44),etc. To eliminate cross-reactivity with other homologues, the antibodyor antisera is subtracted with the HA and/or NA molecules found inpublic databases at the time of filing, e.g., the “control”polypeptide(s). Where the other control sequences correspond to anucleic acid, a polypeptide encoded by the nucleic acid is generated andused for antibody/antisera subtraction purposes.

In one typical format, the immunoassay uses a polyclonal antiserum whichwas raised against one or more polypeptide comprising one or more of thesequences corresponding to the sequences herein (e.g., SEQ ID NOS:11-20, 27-32, and 39-44), etc. or a substantial subsequence thereof(i.e., at least about 30% of the full length sequence provided). The setof potential polypeptide immunogens derived from the present sequencesare collectively referred to below as “the immunogenic polypeptides.”The resulting antisera is optionally selected to have lowcross-reactivity against the control hemagglutinin and/or neuraminidasehomologues and any such cross-reactivity is removed, e.g., byimmunoabsorbtion, with one or more of the control hemagglutinin andneuraminidase homologues, prior to use of the polyclonal antiserum inthe immunoassay.

In order to produce antisera for use in an immunoassay, one or more ofthe immunogenic polypeptides is produced and purified as describedherein. For example, recombinant protein can be produced in arecombinant cell. An inbred strain of mice (used in this assay becauseresults are more reproducible due to the virtual genetic identity of themice) is immunized with the immunogenic protein(s) in combination with astandard adjuvant, such as Freund's adjuvant, and a standard mouseimmunization protocol (see, e.g., Harlow and Lane (1988) Antibodies, ALaboratory Manual, Cold Spring Harbor Publications, New York, for astandard description of antibody generation, immunoassay formats andconditions that can be used to determine specific immunoreactivity).Additional references and discussion of antibodies is also found hereinand can be applied here to defining polypeptides by immunoreactivity.Alternatively, one or more synthetic or recombinant polypeptide derivedfrom the sequences disclosed herein is conjugated to a carrier proteinand used as an immunogen.

Polyclonal sera are collected and titered against the immunogenicpolypeptide in an immunoassay, for example, a solid phase immunoassaywith one or more of the immunogenic proteins immobilized on a solidsupport. Polyclonal antisera with a titer of 10⁶ or greater areselected, pooled and subtracted with the control hemagglutinin and/orneuraminidase polypeptide(s) to produce subtracted pooled titeredpolyclonal antisera.

The subtracted pooled titered polyclonal antisera are tested for crossreactivity against the control homologue(s) in a comparativeimmunoassay. In this comparative assay, discriminatory bindingconditions are determined for the subtracted titered polyclonal antiserawhich result in at least about a 5-10 fold higher signal to noise ratiofor binding of the titered polyclonal antisera to the immunogenicpolypeptides as compared to binding to the control homologues. That is,the stringency of the binding reaction is adjusted by the addition ofnon-specific competitors such as albumin or non-fat dry milk, and/or byadjusting salt conditions, temperature, and/or the like. These bindingconditions are used in subsequent assays for determining whether a testpolypeptide (a polypeptide being compared to the immunogenicpolypeptides and/or the control polypeptides) is specifically bound bythe pooled subtracted polyclonal antisera. In particular, testpolypeptides which show at least a 2-5× higher signal to noise ratiothan the control receptor homologues under discriminatory bindingconditions, and at least about a ½ signal to noise ratio as compared tothe immunogenic polypeptide(s), shares substantial structural similaritywith the immunogenic polypeptide as compared to the known receptor,etc., and is, therefore a polypeptide of the invention.

In another example, immunoassays in the competitive binding format areused for detection of a test polypeptide. For example, as noted,cross-reacting antibodies are removed from the pooled antisera mixtureby immunoabsorbtion with the control polypeptides. The immunogenicpolypeptide(s) are then immobilized to a solid support which is exposedto the subtracted pooled antisera. Test proteins are added to the assayto compete for binding to the pooled subtracted antisera. The ability ofthe test protein(s) to compete for binding to the pooled subtractedantisera as compared to the immobilized protein(s) is compared to theability of the immunogenic polypeptide(s) added to the assay to competefor binding (the immunogenic polypeptides compete effectively with theimmobilized immunogenic polypeptides for binding to the pooledantisera). The percent cross-reactivity for the test proteins iscalculated, using standard calculations.

In a parallel assay, the ability of the control protein(s) to competefor binding to the pooled subtracted antisera is optionally determinedas compared to the ability of the immunogenic polypeptide(s) to competefor binding to the antisera. Again, the percent cross-reactivity for thecontrol polypeptide(s) is calculated, using standard calculations. Wherethe percent cross-reactivity is at least 5-10× as high for the testpolypeptides as compared to the control polypeptide(s) and or where thebinding of the test polypeptides is approximately in the range of thebinding of the immunogenic polypeptides, the test polypeptides are saidto specifically bind the pooled subtracted antisera.

In general, the immunoabsorbed and pooled antisera can be used in acompetitive binding immunoassay as described herein to compare any testpolypeptide to the immunogenic and/or control polypeptide(s). In orderto make this comparison, the immunogenic, test and control polypeptidesare each assayed at a wide range of concentrations and the amount ofeach polypeptide required to inhibit 50% of the binding of thesubtracted antisera to, e.g., an immobilized control, test orimmunogenic protein is determined using standard techniques. If theamount of the test polypeptide required for binding in the competitiveassay is less than twice the amount of the immunogenic polypeptide thatis required, then the test polypeptide is said to specifically bind toan antibody generated to the immunogenic protein, provided the amount isat least about 5-10× as high as for the control polypeptide.

As an additional determination of specificity, the pooled antisera isoptionally fully immunosorbed with the immunogenic polypeptide(s)(rather than the control polypeptide(s)) until little or no binding ofthe resulting immunogenic polypeptide subtracted pooled antisera to theimmunogenic polypeptide(s) used in the immunosorbtion is detectable.This fully immunosorbed antisera is then tested for reactivity with thetest polypeptide. If little or no reactivity is observed (i.e., no morethan 2× the signal to noise ratio observed for binding of the fullyimmunosorbed antisera to the immunogenic polypeptide), then the testpolypeptide is specifically bound by the antisera elicited by theimmunogenic protein.

Nucleic Acid and Polypeptide Sequence Variants

As described herein, the invention provides for nucleic acidpolynucleotide sequences and polypeptide amino acid sequences, e.g.,hemagglutinin and neuraminidase sequences, and, e.g., compositions andmethods comprising said sequences. Examples of said sequences aredisclosed herein (e.g., SEQ ID NOS: 1-45). However, one of skill in theart will appreciate that the invention is not necessarily limited tothose sequences disclosed herein and that the present invention alsoprovides many related and unrelated sequences with the functionsdescribed herein, e.g., encoding a HA and/or a NA molecule.

One of skill will also appreciate that many variants of the disclosedsequences are included in the invention. For example, conservativevariations of the disclosed sequences that yield a functionallyidentical sequence are included in the invention. Variants of thenucleic acid polynucleotide sequences, wherein the variants hybridize toat least one disclosed sequence, are considered to be included in theinvention. Unique subsequences of the sequences disclosed herein, asdetermined by, e.g., standard sequence comparison techniques, are alsoincluded in the invention.

Silent Variations

Due to the degeneracy of the genetic code, any of a variety of nucleicacid sequences encoding polypeptides and/or viruses of the invention areoptionally produced, some which can bear lower levels of sequenceidentity to the HA and NA nucleic acid and polypeptide sequences herein.The following provides a typical codon table specifying the geneticcode, found in many biology and biochemistry texts.

TABLE 1 Codon Table Amino acids Codon Alanine Ala A GCA GCC GCG GCUCysteine Cys C UGC UGU Aspartic  Asp D GAC GAU acid Glutamic  Glu E GAAGAG acid Pheny- Phe F UUC UUU lalanine Glycine Gly G GGA GGC GGG GGUHistidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAAAAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUGAsparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln QCAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCAUCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUUTryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

The codon table shows that many amino acids are encoded by more than onecodon. For example, the codons AGA, AGG, CGA, CGC, CGG, and CGU allencode the amino acid arginine. Thus, at every position in the nucleicacids of the invention where an arginine is specified by a codon, thecodon can be altered to any of the corresponding codons described abovewithout altering the encoded polypeptide. It is understood that U in anRNA sequence corresponds to T in a DNA sequence.

Such “silent variations” are one species of “conservatively modifiedvariations,” discussed below. One of skill will recognize that eachcodon in a nucleic acid (except ATG, which is ordinarily the only codonfor methionine, and TTG, which is ordinarily the only codon fortryptophan) can be modified by standard techniques to encode afunctionally identical polypeptide. Accordingly, each silent variationof a nucleic acid which encodes a polypeptide is implicit in anydescribed sequence. The invention, therefore, explicitly provides eachand every possible variation of a nucleic acid sequence encoding apolypeptide of the invention that could be made by selectingcombinations based on possible codon choices. These combinations aremade in accordance with the standard triplet genetic code (e.g., as setforth in Table 1, or as is commonly available in the art) as applied tothe nucleic acid sequence encoding a hemagglutinin or a neuraminidasepolypeptide of the invention. All such variations of every nucleic acidherein are specifically provided and described by consideration of thesequence in combination with the genetic code. One of skill is fullyable to make these silent substitutions using the methods herein.

Conservative Variations

Owing to the degeneracy of the genetic code, “silent substitutions”(i.e., substitutions in a nucleic acid sequence which do not result inan alteration in an encoded polypeptide) are an implied feature of everynucleic acid sequence of the invention which encodes an amino acid.Similarly, “conservative amino acid substitutions,” in one or a fewamino acids in an amino acid sequence are substituted with differentamino acids with highly similar properties, are also readily identifiedas being highly similar to a disclosed construct such as those herein.Such conservative variations of each disclosed sequence are a feature ofthe present invention.

“Conservative variations” of a particular nucleic acid sequence refersto those nucleic acids which encode identical or essentially identicalamino acid sequences, or, where the nucleic acid does not encode anamino acid sequence, to essentially identical sequences, see, Table 2below. One of skill will recognize that individual substitutions,deletions or additions which alter, add or delete a single amino acid ora small percentage of amino acids (typically less than 5%, moretypically less than 4%, 3%, 2% or 1%) in an encoded sequence are“conservatively modified variations” where the alterations result in thedeletion of an amino acid, addition of an amino acid, or substitution ofan amino acid with a chemically similar amino acid. Thus, “conservativevariations” of a listed polypeptide sequence of the present inventioninclude substitutions of a small percentage, typically less than 5%,more typically less than 4%, 3%, 2% or 1%, of the amino acids of thepolypeptide sequence, with a conservatively selected amino acid of thesame conservative substitution group. Finally, the addition of sequenceswhich do not alter the encoded activity of a nucleic acid molecule, suchas the addition of a non-functional sequence, is a conservativevariation of the basic nucleic acid.

TABLE 2 Conservative Substitution Groups 1 Alanine (A) Serine (S)Threonine (T) 2 Aspartic acid (D) Glutamic acid (E) 3 Asparagine (N)Glutamine (Q) 4 Arginine (R) Lysine (K) 5 Isoleucine (I) Leucine (L)Methionine (M) Valine (V) 6 Phenylalanine (F) Tyrosine (Y) Tryptophan(W)

Unique Polypeptide and Polynucleotide Subsequences

In one aspect, the invention provides a nucleic acid which comprises aunique subsequence in a nucleic acid selected from the sequence of HAand NA molecules disclosed herein, e.g., SEQ ID NOS: 1-10, 21-26, 33-38,and 45. The unique subsequence is unique as compared to a nucleic acidscorresponding to nucleic acids such as, e.g., those found in GenBank orother similar public databases at the time of filing. Alignment can beperformed using, e.g., BLAST set to default parameters. Any uniquesubsequence is useful, e.g., as a probe to identify the nucleic acids ofthe invention. See, above.

Similarly, the invention includes a polypeptide which comprises a uniquesubsequence in a polypeptide selected from the sequence of HA and NAmolecules disclosed herein, e.g., SEQ ID NOS: 11-20, 27-32, and 39-44.Here, the unique subsequence is unique as compared to a polypeptidecorresponding to, e.g., the amino acid corresponding to polynucleotidesequences found in, e.g., GenBank or other similar public databases atthe time of filing.

The invention also provides for target nucleic acids which hybridizeunder stringent conditions to a unique coding oligonucleotide whichencodes a unique subsequence in a polypeptide selected from thesequences of HA and NA molecules of the invention wherein the uniquesubsequence is unique as compared to a polypeptide corresponding to anyof the control polypeptides (sequences of, e.g., the nucleic acidscorresponding to those found in, e.g., GenBank or other similar publicdatabases at the time of filing). Unique sequences are determined asnoted above.

Sequence Comparison Identity and Homology

The terms “identical” or percent “identity,” in the context of two ormore nucleic acid or polypeptide sequences, refer to two or moresequences or subsequences that are the same or have a specifiedpercentage of amino acid residues or nucleotides that are the same, whencompared and aligned for maximum correspondence, as measured using oneof the sequence comparison algorithms described below (or otheralgorithms available to persons of skill) or by visual inspection.

The phrase “substantially identical,” in the context of two nucleicacids or polypeptides (e.g., DNAs encoding a HA or NA molecule, or theamino acid sequence of a HA or NA molecule) refers to two or moresequences or subsequences that have at least about 90%, preferably 91%,most preferably 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.1%,99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8%, 99.9% or morenucleotide or amino acid residue identity, when compared and aligned formaximum correspondence, as measured using a sequence comparisonalgorithm or by visual inspection. Such “substantially identical”sequences are typically considered to be “homologous,” without referenceto actual ancestry. Preferably, “substantial identity” exists over aregion of the amino acid sequences that is at least about 200 residuesin length, at least about 250 residues, at least about 300 residues, 350residues, 400 residues, 425 residues, 450 residues, 475 residues, 480residues, 490 residues, 495 residues, 499 residues, 500 residues, 502residues, 559 residues, 565 residues, or 566 residues, or over the fulllength of the two sequences to be compared.

For sequence comparison and homology determination, typically onesequence acts as a reference sequence to which test sequences arecompared. When using a sequence comparison algorithm, test and referencesequences are input into a computer, subsequence coordinates aredesignated, if necessary, and sequence algorithm program parameters aredesignated. The sequence comparison algorithm then calculates thepercent sequence identity for the test sequence(s) relative to thereference sequence, based on the designated program parameters.

Optimal alignment of sequences for comparison can be conducted, e.g., bythe local homology algorithm of Smith & Waterman, Adv Appl Math 2:482(1981), by the homology alignment algorithm of Needleman & Wunsch, J MolBiol 48:443 (1970), by the search for similarity method of Pearson &Lipman, Proc Natl Acad Sci USA 85:2444 (1988), by computerizedimplementations of algorithms such as GAP, BESTFIT, FASTA, and TFASTA inthe Wisconsin Genetics Software Package, Genetics Computer Group, 575Science Dr., Madison, Wis., or by visual inspection (see generally,Ausubel et al., supra).

One example of an algorithm that is suitable for determining percentsequence identity and sequence similarity is the BLAST algorithm, whichis described in Altschul et al., J Mol Biol 215:403-410 (1990). Softwarefor performing BLAST analyses is publicly available through the NationalCenter for Biotechnology Information (www.ncbi.nlm.nih.gov/. Thisalgorithm involves first identifying high scoring sequence pairs (HSPs)by identifying short words of length W in the query sequence, whicheither match or satisfy some positive-valued threshold score T whenaligned with a word of the same length in a database sequence. T isreferred to as the neighborhood word score threshold (see, Altschul etal., supra). These initial neighborhood word hits act as seeds forinitiating searches to find longer HSPs containing them. The word hitsare then extended in both directions along each sequence for as far asthe cumulative alignment score can be increased. Cumulative scores arecalculated using, for nucleotide sequences, the parameters M (rewardscore for a pair of matching residues; always >0) and N (penalty scorefor mismatching residues; always <0). For amino acid sequences, ascoring matrix is used to calculate the cumulative score. Extension ofthe word hits in each direction are halted when: the cumulativealignment score falls off by the quantity X from its maximum achievedvalue; the cumulative score goes to zero or below, due to theaccumulation of one or more negative-scoring residue alignments; or theend of either sequence is reached. The BLAST algorithm parameters W, T,and X determine the sensitivity and speed of the alignment. The BLASTNprogram (for nucleotide sequences) uses as defaults a wordlength (W) of11, an expectation (E) of 10, a cutoff of 100, M=5, N=−4, and acomparison of both strands. For amino acid sequences, the BLASTP programuses as defaults a wordlength (W) of 3, an expectation (E) of 10, andthe BLOSUM62 scoring matrix (see, Henikoff & Henikoff (1989) Proc NatlAcad Sci USA 89:10915).

In addition to calculating percent sequence identity, the BLASTalgorithm also performs a statistical analysis of the similarity betweentwo sequences (see, e.g., Karlin & Altschul, Proc Natl Acad Sci USA90:5873-5787 (1993)). One measure of similarity provided by the BLASTalgorithm is the smallest sum probability (P(N)), which provides anindication of the probability by which a match between two nucleotide oramino acid sequences would occur by chance. For example, a nucleic acidis considered similar to a reference sequence if the smallest sumprobability in a comparison of the test nucleic acid to the referencenucleic acid is less than about 0.1, more preferably less than about0.01, and most preferably less than about 0.001.

Another example of a useful sequence alignment algorithm is PILEUP.PILEUP creates a multiple sequence alignment from a group of relatedsequences using progressive, pairwise alignments. It can also plot atree showing the clustering relationships used to create the alignment.PILEUP uses a simplification of the progressive alignment method of Feng& Doolittle (1987) J. Mol. Evol. 35:351-360. The method used is similarto the method described by Higgins & Sharp (1989) CABIOS5:151-153. Theprogram can align, e.g., up to 300 sequences of a maximum length of5,000 letters. The multiple alignment procedure begins with the pairwisealignment of the two most similar sequences, producing a cluster of twoaligned sequences. This cluster can then be aligned to the next mostrelated sequence or cluster of aligned sequences. Two clusters ofsequences can be aligned by a simple extension of the pairwise alignmentof two individual sequences. The final alignment is achieved by a seriesof progressive, pairwise alignments. The program can also be used toplot a dendogram or tree representation of clustering relationships. Theprogram is run by designating specific sequences and their amino acid ornucleotide coordinates for regions of sequence comparison.

An additional example of an algorithm that is suitable for multiple DNA,or amino acid, sequence alignments is the CLUSTALW program (Thompson, J.D. et al. (1994) Nucl. Acids. Res. 22: 4673-4680). CLUSTALW performsmultiple pairwise comparisons between groups of sequences and assemblesthem into a multiple alignment based on homology. Gap open and Gapextension penalties can be, e.g., 10 and 0.05 respectively. For aminoacid alignments, the BLOSUM algorithm can be used as a protein weightmatrix. See, e.g., Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci.USA 89: 10915-10919.

Digital Systems

The present invention provides digital systems, e.g., computers,computer readable media and integrated systems comprising characterstrings corresponding to the sequence information herein for the nucleicacids and isolated or recombinant polypeptides herein, including, e.g.,the sequences shown herein, and the various silent substitutions andconservative substitutions thereof. Integrated systems can furtherinclude, e.g., gene synthesis equipment for making genes correspondingto the character strings.

Various methods known in the art can be used to detect homology orsimilarity between different character strings (see, above), or can beused to perform other desirable functions such as to control outputfiles, provide the basis for making presentations of informationincluding the sequences and the like. Examples include BLAST, discussedsupra. Computer systems of the invention can include such programs,e.g., in conjunction with one or more data file or data base comprisinga sequence as noted herein.

Thus, different types of homology and similarity of various stringencyand length between various HA or NA sequences or fragments, etc. can bedetected and recognized in the integrated systems herein. For example,many homology determination methods have been designed for comparativeanalysis of sequences of biopolymers, for spell-checking in wordprocessing, and for data retrieval from various databases. With anunderstanding of double-helix pair-wise complement interactions among 4principal nucleobases in natural polynucleotides, models that simulateannealing of complementary homologous polynucleotide strings can also beused as a foundation of sequence alignment or other operations typicallyperformed on the character strings corresponding to the sequences herein(e.g., word-processing manipulations, construction of figures comprisingsequence or subsequence character strings, output tables, etc.).

Thus, standard desktop applications such as word processing software(e.g., Microsoft Word™ or Corel WordPerfect™) and database software(e.g., spreadsheet software such as Microsoft Excel™, Corel QuattroPro™, or database programs such as Microsoft Access™, Paradox™,GeneWorks™, or MacVector™ or other similar programs) can be adapted tothe present invention by inputting a character string corresponding toone or more polynucleotides and polypeptides of the invention (eithernucleic acids or proteins, or both). For example, a system of theinvention can include the foregoing software having the appropriatecharacter string information, e.g., used in conjunction with a userinterface (e.g., a GUI in a standard operating system such as a Windows,Macintosh or LINUX system) to manipulate strings of characterscorresponding to the sequences herein. As noted, specialized alignmentprograms such as BLAST can also be incorporated into the systems of theinvention for alignment of nucleic acids or proteins (or correspondingcharacter strings).

Systems in the present invention typically include a digital computerwith data sets entered into the software system comprising any of thesequences herein. The computer can be, e.g., a PC (Intel x86 or Pentiumchip-compatible DOS™, OS2™ WINDOWS™ WINDOWSNT™, WINDOWS95™,WINDOWS2000™, WINDOWS98™, LINUX based machine, a MACINTOSH™, Power PC,or a UNIX based (e.g., SUN™ work station) machine) or other commerciallyavailable computer that is known to one of skill. Software for aligningor otherwise manipulating sequences is available, or can easily beconstructed by one of skill using a standard programming language suchas Visualbasic, PERL, Fortran, Basic, Java, or the like.

Any controller or computer optionally includes a monitor which is oftena cathode ray tube (“CRT”) display, a flat panel display (e.g., activematrix liquid crystal display, liquid crystal display), or others.Computer circuitry is often placed in a box which includes numerousintegrated circuit chips, such as a microprocessor, memory, interfacecircuits, and others. The box also optionally includes a hard diskdrive, a floppy disk drive, a high capacity removable drive such as awriteable CD-ROM, and other common peripheral elements. Inputtingdevices such as a keyboard or mouse optionally provide for input from auser and for user selection of sequences to be compared or otherwisemanipulated in the relevant computer system.

The computer typically includes appropriate software for receiving userinstructions, either in the form of user input into a set parameterfields, e.g., in a GUI, or in the form of preprogrammed instructions,e.g., preprogrammed for a variety of different specific operations. Thesoftware then converts these instructions to appropriate language forinstructing the operation, e.g., of appropriate mechanisms or transportcontrollers to carry out the desired operation. The software can alsoinclude output elements for controlling nucleic acid synthesis (e.g.,based upon a sequence or an alignment of sequences herein), comparisonsof samples for differential gene expression, or other operations.

Kits and Reagents

The present invention is optionally provided to a user as a kit. Forexample, a kit of the invention contains one or more nucleic acid,polypeptide, antibody, or cell line described herein (e.g., comprising,or with, a HA and/or NA molecule of the invention). The kit can containa diagnostic nucleic acid or polypeptide, e.g., antibody, probe set,e.g., as a cDNA micro-array packaged in a suitable container, or othernucleic acid such as one or more expression vector. The kit can alsofurther comprise, one or more additional reagents, e.g., substrates,labels, primers, for labeling expression products, tubes and/or otheraccessories, reagents for collecting samples, buffers, hybridizationchambers, cover slips, etc. The kit optionally further comprises aninstruction set or user manual detailing preferred methods of using thekit components for discovery or application of diagnostic sets, etc.

When used according to the instructions, the kit can be used, e.g., forevaluating a disease state or condition, for evaluating effects of apharmaceutical agent or other treatment intervention on progression of adisease state or condition in a cell or organism, or for use as avaccine, etc.

In an additional aspect, the present invention provides system kitsembodying the methods, composition, systems and apparatus herein. Systemkits of the invention optionally comprise one or more of the following:(1) an apparatus, system, system component or apparatus component; (2)instructions for practicing methods described herein, and/or foroperating the apparatus or apparatus components herein and/or for usingthe compositions herein. In a further aspect, the present inventionprovides for the use of any apparatus, apparatus component, compositionor kit herein, for the practice of any method or assay herein, and/orfor the use of any apparatus or kit to practice any assay or methodherein.

Additionally, the kits can include one or more translation system asnoted above (e.g., a cell) with appropriate packaging material,containers for holding the components of the kit, instructionalmaterials for practicing the methods herein and/or the like. Similarly,products of the translation systems (e.g., proteins such as HA and/or NAmolecules) can be provided in kit form, e.g., with containers forholding the components of the kit, instructional materials forpracticing the methods herein and/or the like.

To facilitate use of the methods and compositions of the invention, anyof the vaccine components and/or compositions, e.g., reasserted virus inallantoic fluid, etc., and additional components, such as, buffer,cells, culture medium, useful for packaging and infection of influenzaviruses for experimental or therapeutic vaccine purposes, can bepackaged in the form of a kit. Typically, the kit contains, in additionto the above components, additional materials which can include, e.g.,instructions for performing the methods of the invention, packagingmaterial, and a container.

EXAMPLES Example 1 Construction and Analysis of H5N1 ca Viruses andVaccines

Various sequences herein comprising H5N1 HA/NA sequences were used tocreate influenza viruses and vaccines. The HA sequences in such vaccineswere altered from wild-type by removal of the polybasic cleavage sitewithin the HA. The HA/NA sequences were reasserted (in a 6:2reassortment) with ca A/AA/6/60 (a ts, att, ca virus, see above).

Three strains of H5N1 influenza were used in this example:A/VN/1203/2004, A/HK/491/1997, and A/HK/213/2003. Such strains are alsoreferred to within this example as the '97, '03, and '04 strains basedon their year designations. The HA sequence homology of these threestrains is 95-96%. FIG. 1 illustrates modification of the polybasiccleavage site of an exemplary HA sequence, the '04 HA sequences, used toconstruct the viruses/vaccines. As stated previously, variousembodiments of the invention comprise sequences which have differingregions of the polybasic cleavage site removed. See above.

As stated, the modified H5N1 sequences (i.e., the modified '97, '03, and'04 genes) were used to construct 6:2 reassortant viruses with caA/AA/6/60. It will be appreciated, and is pointed out elsewhere herein,that other desirable backbones could also have been used (e.g., PR8,etc.).

In the 6:2 reassortants of this example, the HA and NA gene sequenceswere derived from one or more wild type parent virus, i.e., the HA andNA gene sequences of the '03 virus were derived from A/HK/213/2003, theHA and NA gene sequences of the '04 virus were derived from A %VN/1203/2004, and the HA gene sequence of the '97 virus was derived fromA/HK/491/1997 while the NA gene sequence was derived from A/HK/486/1997.The remaining genes of the 6:2 reassortants were characterized bysequence analysis as derived from the A/AA/6/60 ca parent virus. Thereasserted viruses replicated to 8.0-8.5 log₁₀TCID₅₀ in eggs. However,it will be appreciated that other embodiments wherein the log₁₀TCID₅₀comprises from about 7.0 to about 9.0, from about 7.5-8.5, or from about8.0-8.5 are also within the scope of the invention. The cleavability ofthe modified HA in the constructed viruses by endogenous proteases wasrestricted in vitro and the viruses were dependent on trypsin (e.g.,from about 0.1 ug/ml to about 1.0 ug/ml) for growth. The constructedviruses were temperature sensitive as assayed by an in vitro assay.

The H5N1 ca reassortant viruses (having the modified '97, '03, or '04 HAgenes) were not lethal for chickens. For example, when 4-week-old SPFwhite Plymouth Rock chickens were inoculated intravenously with a 1:10dilution of stock virus (10^(8-8.75) TCID₅₀/ml) and observed for 10days, it was observed that 8 out of 8 chickens died within 1-2 days whenwild-type '97, '03, and '04 H5N1 were used, while 0 of 8 chickens diedwhen the H5N1 ca reassortant viruses were used. As can be seen in FIG.2, the intranasally administered H5N1 ca reassortant viruses did notreplicate in chickens.

The H5N1/AA ca reassortants were also not lethal for mice. See FIG. 3,which also shows the TCID₅₀ for the H5N1 wild-type strains. FIG. 4 showsthat the 1997 and 2004 H5N1 ca reassortant viruses were restricted inreplication in mice. FIG. 5, shows that the H5N1 ca reassorted virusesare restricted in replication in lungs of mice.

A comparison of the serum HAI antibody titers elicited in mice followinga single intranasal dose of vaccine (2003 ca as compared against 2003wild-type), is shown in FIG. 6. FIGS. 7 and 15 show similarmeasurements, but using serum neutralizing antibody titers.

FIG. 8 displays that the H5N1 ca reassortant viruses protect mice fromlethal challenge with 50, 500, or 5,000 LD₅₀ of wild-type H5N1 virus.FIG. 9 shows the efficacy of protection from pulmonary replication ofhomologous and heterologous H5N1 challenge viruses in mice. As can beseen, the ca reassortants replicated less well than the wild-typeviruses did. FIG. 10 shows related data using upper respiratory tractsof mice. Those of skill in the art will be familiar with homologous andheterologous challenges (e.g., testing whether 2003 vaccine protectsagainst a 2003 wild-type challenge (homologous) or whether a 2003vaccine protects against a 1997 wild-type challenge (heterologous),etc.).

FIG. 11 shows efficacy of protection conferred by 2004 H5N1 ca vaccineagainst high dose (10⁵TCID₅₀) challenge with homologous or heterologousH5N1 wild-type viruses in mice. FIG. 12 shows efficacy of protectionconferred by 1997 and 2003 H5N1 ca vaccines against high dose(10⁵TCID₅₀) challenge with homologous or heterologous H5N1 wild-typeviruses in mice. FIG. 13 shows efficacy of protection conferred by 2004H5N1 ca vaccine against low or high doses of homologous H5N1 wild-typevirus challenge in mice. FIGS. 11-13 demonstrate that the testedvaccines could protect against other related viruses.

In healthy human adults nasal spray administration the '04 vaccine waswell tolerated and its replication was highly restricted. See FIG. 27for replication restriction of the vaccine in healthy adults. HIantibody responses to 10^(6.7) TCID₅₀ of the '04 vaccine were alsoobserved in some of the healthy adults. See FIG. 28.

The current example demonstrates several points concerning exemplaryH5N1 ca reassortant viruses/vaccines of the invention. The modified careassortant '97, '03, and '04 viruses were shown to have in vitro tsphenotype, loss of pathogenicity in chickens and attenuation in mice. Itis expected that attenuation is also present in ferrets. Efficacy ofprotection and cross-protection against lethal challenge and systemicspread with wild-type viruses in mice was also shown. Efficacy ofprotection and cross-protections against replication of wild-typechallenge viruses in the respiratory tract of mice is also expected.

It is contemplated to use these (and similar) viruses/vaccines todetermine whether immunogenicity and efficacy is improved following 2doses of vaccine; to assess immunogenicity in non-human primates; toassess attenuation and vaccine efficacy in ferrets; to determine thecontribution of humoral and cellular immunity to observed efficacy ofthe produced vaccines in mice; to determine which residues of the 2003HA contribute to enhanced immunogenicity and introduce them into 1997and 2004 HAs; and to determine the effects of deleting the multibasicamino acid cleavage site and of the gene constellation.

Example 2 Construction and Analysis of H6 ca Viruses and Vaccines

A set of three recombinant influenza viruses and vaccines comprising H6HA sequences were prepared: (a) A/Duck, which comprised the H6 HA and N9NA of A/Duck77; (b) A/Teal, which comprised the H6 HA and N1 NA ofA/Teal97; and (c) A/Mallard, which comprised the H6 HA and N2 NA ofA/Mallard85. The six internal genome segments of each recombinant viruswere those of ca A/AA/6/60.

Each of the A/Duck, A/Teal, and A/Mallard recombinant viruses wasattenuated in nasal turbinates and lungs of ferrets. Ferrets wereintranasally inoculated with 10⁷ TCID₅₀ recombinant (ca; see paragraphimmediately above) or wild-type (wt) H6 influenza virus. Nasal turbinateand lung tissue was harvested from the ferrets three days post-infectionfor examination. FIG. 16 shows that the nasal turbinate and lung tissueof ferrets inoculated with recombinant virus (ca) exhibited lower virustiters than did the nasal turbinate and lung tissue of ferretsinoculated with the respective counterpart wt virus.

Each of the A/Duck, A/Teal, and A/Mallard recombinant (ca) viruses wasalso immunogenic in the ferrets. See FIG. 17.

FIG. 18 shows the efficacy of protection conferred by the A/Duck,A/Teal, and A/Mallard vaccines. Ferrets were vaccinated with a singledose of 7 log₁₀ PFU recombinant A/Duck, A/Teal, or A/Mallard vaccine.The ferrets were then challenged with 7 log₁₀ PFU wt A/Duck, A/Teal, orA/Mallard virus. Three days post challenge lungs and nasal turbinates ofthe ferrets were harvested and virus titer in the tissues wasdetermined. FIG. 18 shows efficacy of protection conferred by therecombinant (ca) H6 vaccines against homologous and heterologouswild-type H6 viruses in ferrets.

Example 3 Construction and Analysis of an H7N3 BC 04 ca, Virus andVaccine

A further recombinant influenza virus and vaccine was prepared using theHA H7 and NA N3 sequences of A/ck/BC/CN-6/04 (BC 04 ca). These HA and NAsequences were combined with the six internal genome segments of caA/AA/6/60.

The BC 04 ca vaccine was attenuated in the ferrets. Ferrets wereintranasally inoculated with 10⁷ TCID₅₀ vaccine in 0.5 mL. Three daysfollowing inoculation, ferret nasal turbinates, lungs, brain, andolfactory bulb were harvested. Virus titer in each of these tissues wasdiminished in ferrets inoculated with the vaccine virus relative toferrets inoculated with wt viruses A/BC/CN-6/04 or A/BC/CN-7/04. SeeFIG. 19.

The BC 04 ca vaccine was immunogenic in mice. In mice receiving the BC04 ca vaccine, neutralizing antibodies were detected at 4 weeks andthese titers rose over 8 weeks. A second dose of vaccine boostedantibody titer but final titer achieved was similar to that following asingle dose. See FIG. 20.

FIGS. 21 and 22 show the efficacy of protection conferred by the BC 04ca vaccine against both homologous and heterologous H7 wt viruses. ForFIG. 21, mice were intranasally inoculated with 1 dose vaccine fourweeks before challenge, 1 dose vaccine 8 weeks before challenge, or 2doses vaccine (administered 4 weeks apart) before lethal challenge with50 LD₅₀ homologous (A/ck/BC/CN-7/04) and heterologous (A/NL/219/03 orA/tk/Eng/63) H7 wt viruses. Weight change of the mice following lethalchallenged was monitored each day for fourteen days, to monitormorbidity associated with the wt influenza virus challenge.

For each of the mice lethally challenged with the homologousA/ck/BC/CN-7/04 virus little or no weight change was observed regardlessof whether 1 dose of vaccine was administered 4 weeks prior to challenge(a), 1 dose of vaccine was administered 8 weeks prior to challenge (b)or 2 doses of vaccine were administered prior to challenge (c).Likewise, little to no weight loss occurred following challenge of themice with either heterologous influenza virus, A/NL/2109/03 (d, e, f) orA/tk/Eng/63 (g, h, i). Again, the lack of weight loss was observedregardless of whether 1 dose of vaccine was administered 4 weeks priorto challenge (d or g), 1 dose of vaccine was administered 8 weeks priorto challenge (e, or h), or 2 doses of vaccine were administered prior tochallenge (f or i).

FIG. 22 provides further evidence of the efficacy of the H7N3 BC 04 cavaccine. In both nasal turbinates (a) and lungs (b) of mice receivingthe H7N3 BC 04 ca vaccine, protection was observed against challengeusing ck/BC/CN-6/04 (H7N3), ck/BC/CN-7/04 (H7N3), NL/219/03 (H7N7),tk/Eng/63 (H7N3), tk/UT/95 (H7N3), and tk/VA/02 (H7N2) viruses.

Example 4 Construction and Analysis of an H9N2 G9/AA ca, Virus andVaccine

A further recombinant influenza virus and vaccine was prepared using theHA H9 and NA N2 sequences of A/ck/Hong Kong/G9/97 (G9/AA ca). These HAand NA sequences were combined with the six internal genome segments ofca A/AA/6/60.

The H9N2 G9/AA ca vaccine was attenuated in the ferrets. See FIG. 23,which shows reduced virus titers in nasal turbinates (a) and lungs (b)of ferrets following administration of the H9N2 G9/AA ca virus relativeto the H9N2 G9 wt virus.

FIG. 24 provides evidence of the efficacy of the H9N2 G9 ca vaccine inmice. In the mice receiving the H9N2 G9 ca vaccine, protection wasobserved against challenge using H9N2 G9 wt and A/HK/1073/99 viruses.

The H9N2 G9/AA ca vaccine was also well tolerated in healthy adults in aclinical trial setting. Healthy adults were administered the H9N2 G9/AAca vaccine by nose drop. In the healthy adults, the H9N2 G9/AA cavaccine was highly restricted in replication. See FIG. 25. Furthermore,administration of 10^(7.0) TCID₅₀ of H9N2 G9/AA ca vaccine induced≧4-fold increases in HI titer in 92% of healthy volunteers. See FIG. 26.

While the foregoing invention has been described in some detail forpurposes of clarity and understanding, it will be clear to one skilledin the art from a reading of this disclosure that various changes inform and detail can be made without departing from the true scope of theinvention. For example, all the techniques and apparatus described abovemay be used in various combinations. All publications, patents, patentapplications, or other documents cited in this application areincorporated by reference in their entirety for all purposes to the sameextent as if each individual publication, patent, patent application, orother document were individually indicated to be incorporated byreference for all purposes. In particular, U.S. provisional applicationNos. 60/821,832 filed Aug. 9, 2006 and 60/942,804, filed Jun. 8, 2007,are incorporated herein in their entirety for all purposes.

SPECIFIC EMBODIMENTS

Additional embodiments of the present invention are presented in Table 3and 4.

TABLE 3 Specific embodiments 1 An isolated polypeptide, wherein saidpolypeptide is selected from the group consisting of: a) a polypeptideencoded by a polynucleotide sequence as shown in any one of SEQ ID NOS:21-26 or 33-38 or 45; b) a polypeptide as shown in any one of SEQ IDNOS: 27-32 or 39-44; c) the mature form of the polypeptide as shown inany one of SEQ ID NOS: 27-32 or 39-44; d) a polypeptide encoded by apolynucleotide sequence which hybridizes under highly stringentconditions to a polynucleotide sequence encoding (a) (b) or (c); and e)a polypeptide having at least 90% sequence identity to the polypeptideof (b). 2 An immunogenic composition comprising an immunologicallyeffective amount of at least one polypeptide of embodiment 1. 3 Anisolated antibody that specifically binds the polypeptide ofembodiment 1. 4 A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the polypeptide of embodiment 1 in aphysiologically acceptable carrier. 5 A recombinant influenza viruscomprising the polypeptide of embodiment 1. 6 An immunogenic compositioncomprising an immunologically effective amount of the recombinantinfluenza virus of embodiment 5. 7 A method for stimulating the immunesystem of an individual to produce a protective immune response againstinfluenza virus, the method comprising administering to the individualan immunologically effective amount of the recombinant influenza virusof embodiment 5 in a physiologically acceptable carrier. 8 An isolatednucleic acid, wherein said nucleic acid is selected from the groupconsisting of: a) a polynucleotide sequence as shown in any one of SEQID NOS: 21-26 or 33-38 or 45, or a complementary sequence thereof; b) apolynucleotide sequence encoding a polypeptide as shown in any one ofSEQ ID NOS: 27-32 or 39-44, or a complementary polynucleotide sequencethereof; c) a polynucleotide sequence which hybridizes under highlystringent conditions over substantially the entire length ofpolynucleotide sequence (a); and d) a polynucleotide sequence having atleast 98% sequence identity to the polynucleotide sequence of (a). 9 Animmunogenic composition comprising at least one of the nucleic acids ofembodiment 8. 10 A cell comprising at least one nucleic acid ofembodiment 8. 11 A vector comprising the nucleic acid of embodiment 8.12 The vector of embodiment 12, wherein the vector is a plasmid, acosmid, a phage, a virus, or a fragment of a virus. 13 The vector ofembodiment 12, wherein the vector is an expression vector. 14 A cellcomprising the vector of embodiment 13. 15 An influenza virus comprisingone or more nucleic acids of embodiment 8. 16 The virus of embodiment15, wherein the virus is a reassortment virus. 17 A 6:2 reassortmentinfluenza virus, wherein said virus comprises 6 gene encoding regionsfrom A/Ann Arbor/6/60 and 2 gene encoding regions that encodepolypeptides selected from the group consisting of: the polypeptides ofSEQ ID NOS: 27-32, and 39-44. 18 A method of producing a recombinantinfluenza virus, the method comprising: culturing the cell of embodiment14 in a suitable culture medium under conditions permitting expressionof nucleic acid; and, isolating the recombinant influenza virus from acell population comprising said cell or the medium. 19 An immunogeniccomposition comprising an immunologically effective amount of therecombinant influenza virus of embodiment 17. 20 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe recombinant influenza virus of embodiment 17 in a physiologicallyeffective carrier. 21 A method of producing an isolated or recombinantpolypeptide, the method comprising: culturing the host cell ofembodiment 10 in a suitable culture medium under conditions permittingexpression of said nucleic acid; and, isolating the polypeptide from oneor more of the host cells or the medium. 22 A method of prophylactic ortherapeutic treatment of a viral infection in a subject, the methodcomprising: administering to the subject, a virus of embodiment 17 in anamount effective to produce an immunogenic response against the viralinfection. 23 The method of embodiment 22, wherein the subject is ahuman. 24 The immunogenic composition of embodiment 19, wherein thehemagglutinin comprises a modified polybasic cleavage site. 25 A liveattenuated influenza vaccine comprising the composition of embodiment19. 26 A split virus or killed virus vaccine comprising the compositionof embodiment 19. 27 A live attenuated influenza vaccine comprising thecomposition of embodiment 24. 28 A split virus or killed virus vaccinecomprising the composition of embodiment 24. 29 A method for producinginfluenza viruses in cell culture, the method comprising: i) introducinginto a population of host cells, which population of host cells iscapable of supporting replication of influenza virus, a plurality ofvectors comprising nucleic acid encoding at least 6 internal genomesegments of a first influenza strain, wherein the first influenza strainis A/Ann Arbor/6/60; and, at least one genome segment encoding animmunogenic influenza surface antigen of a second influenza strain,wherein said second strain is a pandemic influenza strain, ii) culturingthe population of host cells at a temperature less than or equal to 35°C.; and, iii) recovering a plurality of influenza viruses. 30 The methodof embodiment 29, wherein the plurality of vectors comprise at least oneisolated nucleic acid, wherein said nucleic acid is selected from thegroup consisting of: a) a polynucleotide sequence of one of SEQ ID NOS:21-26 or 33-38, or 45, or a complementary sequence thereof; b) apolynucleotide sequence encoding a polypeptide of one of SEQ ID NOS:27-32 or 39-44, or a complementary polynucleotide sequence thereof; c) apolynucleotide sequence which hybridizes under highly stringentconditions over substantially the entire length of polynucleotidesequence (a); and d) a polynucleotide sequence having at least 98%sequence identity to the polynucleotide sequence of (a). 31 Animmunogenic composition comprising an immunologically effective amountof the influenza virus of embodiment 29. 32 An immunogenic compositioncomprising an immunologically effective amount of the influenza virus ofembodiment 30. 33 A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the influenza virus of embodiment 29in a physiologically effective carrier. 34 A method for stimulating theimmune system of an individual to produce a protective immune responseagainst influenza virus, the method comprising administering to theindividual an immunologically effective amount of the influenza virus ofembodiment 30 in a physiologically effective carrier. 35 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual the immunogenic composition ofembodiment 31. 36 A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual theimmunogenic composition of embodiment 32. 37 A live attenuated influenzavaccine comprising the immunogenic composition of embodiment 31. 38 Asplit virus or killed virus vaccine comprising the immunogeniccomposition of embodiment 32.

TABLE 4 Specific embodiments. 1 An isolated polypeptide, wherein saidpolypeptide is selected from the group consisting of: a) a polypeptidecomprising the amino acid sequence encoded by the nucleotide sequence asshown in any one of SEQ ID NOS: 21-26 or 33-38 or 45; b) a polypeptidecomprising the amino acid sequence as shown in any one of SEQ ID NOS:27-32 or 39-44; c) the mature form of a polypeptide comprising the aminoacid sequence as shown in any one of SEQ ID NOS: 27-32 or 39-44; d) apolypeptide comprising an amino acid sequence encoded by apolynucleotide which hybridizes under highly stringent conditions to apolynucleotide comprising a nucleotide sequence encoding (a) (b) or (c);and e) a polypeptide comprising an amino acid sequence having at least90% sequence identity to the polypeptide of (b). 2 An immunogeniccomposition comprising an immunologically effective amount of at leastone polypeptide of embodiment 1. 3 An isolated antibody thatspecifically binds the polypeptide of embodiment 1. 4 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe polypeptide of embodiment 1 in a physiologically acceptable carrier.5 A recombinant influenza virus comprising the polypeptide ofembodiment 1. 6 An immunogenic composition comprising an immunologicallyeffective amount of the recombinant influenza virus of embodiment 5. 7 Amethod for stimulating the immune system of an individual to produce aprotective immune response against influenza virus, the methodcomprising administering to the individual an immunologically effectiveamount of the recombinant influenza virus of embodiment 5 in aphysiologically acceptable carrier. 8 An isolated polynucleotide,wherein said polynucleotide is selected from the group consisting of: a)a polynucleotide comprising the nucleotide sequence as shown in any oneof SEQ ID NOS: 21-26 or 33-38 or 45, or a complementary sequencethereof; b) a polynucleotide comprising a nucleotide sequence encoding apolypeptide comprising the amino acid sequence as shown in any one ofSEQ ID NOS: 27-32 or 39-44, or a complementary nucleotide sequencethereof; c) a polynucleotide which hybridizes under highly stringentconditions over substantially the entire length of the polynucleotide of(a); and d) a polynucleotide comprising a nucleotide sequence having atleast 98% sequence identity to the polynucleotide of (a). 9 Animmunogenic composition comprising at least one polynucleotide ofembodiment 8. 10 A cell comprising at least one polynucleotide ofembodiment 8. 11 A vector comprising the polynucleotide of embodiment 8.12 The vector of embodiment 11, wherein the vector is a plasmid, acosmid, a phage, a virus, or a fragment of a virus. 13 The vector ofembodiment 12, wherein the vector is an expression vector. 14 A cellcomprising the vector of embodiment 13. 15 An influenza virus comprisingone or more polynucleotides of embodiment 8. 16 The virus of embodiment15, wherein the virus is a reassortant virus. 17 A 6:2 reassortantinfluenza virus, wherein said virus comprises 6 internal genome segmentsfrom A/Ann Arbor/6/60 and 2 genome segments that encode an HA and/or aNA polypeptide selected from the group consisting of: the polypeptidesof SEQ ID NOS: 27-32, and 39-44. 18 A method of producing a reassortantinfluenza virus, the method comprising: culturing the cell of embodiment14 in a suitable culture medium under conditions permitting expressionof said polynucleotide; and, isolating the reassortant influenza virusfrom a cell population comprising said cell or the medium. 19 Animmunogenic composition comprising an immunologically effective amountof the reassortant influenza virus of embodiment 17. 20 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe reassortant influenza virus of embodiment 17 in a physiologicallyeffective carrier. 21 A method of producing an isolated or recombinantpolypeptide, the method comprising: culturing the cell of embodiment 10in a suitable culture medium under conditions permitting expression ofsaid polynucleotide; and, isolating the polypeptide from the cell or themedium. 22 A method of prophylactic or therapeutic treatment of a viralinfection in a subject, the method comprising: administering to thesubject, the virus of embodiment 17 in an amount effective to produce animmunogenic response against the viral infection. 23 The method ofembodiment 22, wherein the subject is a human. 24 The immunogeniccomposition of embodiment 19, wherein the hemagglutinin comprises amodified polybasic cleavage site. 25 A live attenuated influenza vaccinecomprising the composition of embodiment 19. 26 A split virus or killedvirus vaccine comprising the composition of embodiment 19. 27 A liveattenuated influenza vaccine comprising the composition of embodiment24. 28 A split virus or killed virus vaccine comprising the compositionof embodiment 24. 29 A method for producing an influenza virus in cellculture, the method comprising: i) introducing into a population of hostcells, which population of host cells is capable of supportingreplication of influenza virus, a plurality of vectors comprisingnucleotide sequences corresponding to at least 6 internal genomesegments of A/Ann Arbor/6/60; and, at least one genome segmentcomprising a polynucleotide encoding an HA and/or a NA polypeptideselected from the group consisting of: the polypeptides of SEQ ID NOS:27-32, and 39-44, ii) culturing the population of host cells at atemperature less than or equal to 35° C.; and, iii) recovering aninfluenza virus. 30 The method of embodiment 29, wherein thepolynucleotide encoding the HA and/or NA polypeptide is selected fromthe group consisting of: a) a polynucleotide comprising the nucleotidesequence of any one of SEQ ID NOS: 21, 23-26 or 33-38, or 45, or acomplementary nucleotide sequence thereof; b) a polynucleotidecomprising a nucleotide sequence encoding a polypeptide comprising theamino acid sequence as shown in any one of SEQ ID NOS: 27-32 or 39-44,or a complementary nucleotide sequence thereof; c) a polynucleotidewhich hybridizes under highly stringent conditions over substantiallythe entire length of the polynucleotide of (a); and d) a polynucleotidecomprising a nucleotide sequence having at least 98% sequence identityto the polynucleotide of (a). 31 An immunogenic composition comprisingan immunologically effective amount of the influenza virus produced bythe method of embodiment 29. 32 An immunogenic composition comprising animmunologically effective amount of the influenza virus produced by themethod of embodiment 30. 33 A method for stimulating the immune systemof an individual to produce a protective immune response againstinfluenza virus, the method comprising administering to the individualan immunologically effective amount of the influenza virus produced bythe method of embodiment 29 in a physiologically effective carrier. 34 Amethod for stimulating the immune system of an individual to produce aprotective immune response against influenza virus, the methodcomprising administering to the individual an immunologically effectiveamount of the influenza virus produced by the method of embodiment 30 ina physiologically effective carrier. 35 A method for stimulating theimmune system of an individual to produce a protective immune responseagainst influenza virus, the method comprising administering to theindividual the immunogenic composition of embodiment 31. 36 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual the immunogenic composition ofembodiment 32. 37 A live attenuated influenza vaccine comprising theimmunogenic composition of embodiment 31. 38 A split virus or killedvirus vaccine comprising the immunogenic composition of embodiment 32.39 A 6:2 reassortant influenza virus, wherein said virus comprises 6internal genome segments from one or more donor viruses other than A/AnnArbor/6/60 and 2 genome segments that encode an HA and/or a NApolypeptide selected from the group consisting of: the polypeptides ofSEQ ID NOS: 27-32, and 39-44. 40 The 6:2 reassortant influenza virus ofembodiment 39, wherein said donor virus comprises one or more of thefollowing phenotypes: temperature-sensitive, cold- adaped, orattenuated. 41 The 6:2 reassortant influenza virus of embodiment 39,wherein said donor virus is PR8. 42 The 6:2 reassortant influenza virusof embodiment 39, wherein said donor virus is A/Leningrad/17. 43 Animmunogenic composition comprising an immunologically effective amountof the reassortant influenza virus of embodiment 39. 44 An immunogeniccomposition comprising an immunologically effective amount of thereassortant influenza virus of embodiment 40. 45 An immunogeniccomposition comprising an immunologically effective amount of thereassortant influenza virus of embodiment 41. 46 An immunogeniccomposition comprising an immunologically effective amount of thereassortant influenza virus of embodiment 42. 47 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe reassortant influenza virus of embodiment 39 in a physiologicallyeffective carrier. 48 A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the reassortant influenza virus ofembodiment 40 in a physiologically effective carrier. 49 A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe reassortant influenza virus of embodiment 41 in a physiologicallyeffective carrier. 50 A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the reassortant influenza virus ofembodiment 42 in a physiologically effective carrier. 51 A method ofprophylactic or therapeutic treatment of a viral infection in a subject,the method comprising: administering to the subject, the virus ofembodiment 39 in an amount effective to produce an immunogenic responseagainst the viral infection. 52 A method of prophylactic or therapeutictreatment of a viral infection in a subject, the method comprising:administering to the subject, the virus of embodiment 41 in an amounteffective to produce an immunogenic response against the viralinfection. 53 A method of prophylactic or therapeutic treatment of aviral infection in a subject, the method comprising: administering tothe subject, the virus of embodiment 42 in an amount effective toproduce an immunogenic response against the viral infection. 54 Themethod of embodiment 51, wherein said virus is killed or inactivated. 55The method of embodiment 52, wherein said virus is killed orinactivated. 56 The method of embodiment 53, wherein said virus iskilled or inactivated. 57 The immunogenic composition of embodiment 43,wherein the hemagglutinin comprises a modified polybasic cleavage site.58 The immunogenic composition of embodiment 44, wherein thehemagglutinin comprises a modified polybasic cleavage site. 59 Theimmunogenic composition of embodiment 45, wherein the hemagglutinincomprises a modified polybasic cleavage site 60 The immunogeniccomposition of embodiment 46, wherein the hemagglutinin comprises amodified polybasic cleavage site. 61 The method of embodiment 47,wherein the subject is a human. 62 The method of embodiment 48, whereinthe subject is a human. 63 The method of embodiment 49, wherein thesubject is a human. 64 A live attenuated influenza vaccine comprisingthe composition of embodiment 45. 65 A live attenuated influenza vaccinecomprising the composition of embodiment 46. 66 A method for producingan influenza virus in cell culture, the method comprising: i)introducing into a population of host cells, which population of hostcells is capable of supporting replication of influenza virus, aplurality of vectors comprising nucleotide sequences corresponding to:(a) at least 6 internal genome segments of a first influenza strain,wherein the first influenza strain is not A/Ann Arbor/6/60; and, atleast one genome segment encoding an HA or an NA polypeptide selectedfrom the group consisting of: the polypeptides of SEQ ID NOS: 27-32, and39-44; or (b) at least 6 internal genome segments of a first influenzastrain, wherein the first influenza strain is not A/Ann Arbor/6/60 andwhich influenza strain comprises one or more phenotypic attributesselected from the group consisting of: attenuated, cold adapted andtemperature sensitive; and, at least one genome segment encoding an HAor an NA polypeptide selected from the group consisting of: thepolypeptides of SEQ ID NOS: 27-32, and 39-44, ii) culturing thepopulation of host cells at a temperature less than or equal to 35° C.;and, iii) recovering an influenza virus. 67 An immunogenic compositioncomprising an immunologically effective amount of the influenza virusproduced by the method of embodiment 66. 68 A method for stimulating theimmune system of an individual to produce a protective immune responseagainst influenza virus, the method comprising administering to theindividual an immunologically effective amount of the influenza virusproduced by the method of embodiment 66 in a physiologically effectivecarrier. 69 A method for stimulating the immune system of an individualto produce a protective immune response against influenza virus, themethod comprising administering to the individual the immunogeniccomposition of embodiment 67. 70 A live attenuated influenza vaccinecomprising the immunogenic composition of embodiment 67. 71 A splitvirus or killed virus vaccine comprising the immunogenic composition ofembodiment 67.SEQUENCESca A/Vietnam/1203/04

Nucleotide Sequence of ca A/Vietnam/1203/04 H5(SEQ ID NO: 1) Entire molecule length: 1767 nt 1agcaaaagca ggggttcaat ctgtcaaaat ggagaaaata gtgcttcttt 51ttgcaatagt cagtcttgtt aaaagtgatc agatttgcat tggttaccat 101gcaaacaact cgacagagca ggttgacaca ataatggaaa agaacgttac 151tgttacacat gcccaagaca tactggaaaa gaaacacaac gggaagctct 201gcgatctaga tggagtgaag cctctaattt tgagagattg tagcgtagct 251ggatggctcc tcggaaaccc aatgtgtgac gaattcatca atgtgccgga 301atggtcttac atagtggaga aggccaatcc agtcaatgac ctctgttacc 351caggggattt caatgactat gaagaattga aacacctatt gagcagaata 401aaccattttg agaaaattca gatcatcccc aaaagttctt ggtccagtca 451tgaagcctca ttaggggtga gctcagcatg tccataccag ggaaagtcct 501cctttttcag aaatgtggta tggcttatca aaaagaacag tacataccca 551acaataaaga ggagctacaa taataccaac caagaagatc ttttggtact 601gtgggggatt caccatccta atgatgcggc agagcagaca aagctctatc 651aaaacccaac cacctatatt tccgttggga catcaacact aaaccagaga 701ttggtaccaa gaatagctac tagatccaaa gtaaacgggc aaagtggaag 751gatggagttc ttctggacaa ttttaaagcc gaatgatgca atcaacttcg 801agagtaatgg aaatttcatt gctccagaat atgcatacaa aattgtcaag 851aaaggggact caacaattat gaaaagtgaa ttggaatatg gtaactgcaa 901caccaagtgt caaactccaa tgggggcgat aaactctagc atgccattcc 951acaatataca ccctctcacc attggggaat gccccaaata tgtgaaatca 1001aacagattag tccttgcgac tgggctcaga aatagccctc aaagagagac 1051tcgaggatta tttggagcta tagcaggttt tatagaggga ggatggcagg 1101gaatggtaga tggttggtat gggtaccacc atagcaatga gcaggggagt 1151gggtacgctg cagacaaaga atccactcaa aaggcaatag atggagtcac 1201caataaggtc aactcgatca ttgacaaaat gaacactcag tttgaggccg 1251ttggaaggga atttaacaac ttagaaagga gaatagagaa tttaaacaag 1301aagatggaag acgggttcct agatgtctgg acttataatg ctgaacttct 1351ggttctcatg gaaaatgaga gaactctaga ctttcatgac tcaaatgtca 1401agaaccttta cgacaaggtc cgactacagc ttagggataa tgcaaaggag 1451ctgggtaacg gttgtttcga gttctatcat aaatgtgata atgaatgtat 1501ggaaagtgta agaaatggaa cgtatgacta cccgcagtat tcagaagaag 1551cgagactaaa aagagaggaa ataagtggag taaaattgga atcaatagga 1601atttaccaaa tactgtcaat ttattctaca gtggcgagtt ccctagcact 1651ggcaatcatg gtagctggtc tatccttatg gatgtgctcc aatgggtcgt 1701tacaatgcag aatttgcatt taaatttgtg agttcagatt gtagttaaaa 1751acacccttgt ttctact

Amino acid sequence of ca A/Vietnam/1203/04 H5(SEQ ID NO: 11) Entire molecule length: 564 aa 1mekivllfai vslvksdqic igyhannste qvdtimeknv tvthaqdile 51kkhngklcdl dgvkplilrd csvagwllgn pmcdefinvp ewsyivekan 101pvndlcypgd fndyeelkhl lsrinhfeki qiipksswss heaslgvssa 151cpyqgkssff rnvvwlikkn styptikrsy nntnqedllv lwgihhpnda 201aeqtklyqnp ttyisvgtst lnqrlvpria trskvngqsg rmeffwtilk 251pndainfesn gnfiapeyay kivkkgdsti mkseleygnc ntkcqtpmga 301inssmpfhni hpltigecpk yvksnrlvla tglrnspqre trglfgaiag 351fieggwqgmv dgwygyhhsn eqgsgyaadk estqkaidgv tnkvnsiidk 401mntqfeavgr efnnlerrie nlnkkmedgf ldvwtynael lvlmenertl 451dfhdsnvknl ydkvrlqlrd nakelgngcf efyhkcdnec mesvrngtyd 501ypqyseearl kreeisgvkl esigiyqils iystvassla laimvaglsl 551wmcsngslqc rici

Nucleotide Sequence of ca A/Vietnam/1203/04 N1(SEQ ID NO: 2) Entire molecule length: 1398 nt 1agcaaaagca ggagttcaaa atgaatccaa atcagaagat aataaccatc 51gggtcaatct gtatggtaac tggaatagtt agcttaatgt tacaaattgg 101gaacatgatc tcaatatggg tcagtcattc aattcacaca gggaatcaac 151accaatctga accaatcagc aatactaatt ttcttactga gaaagctgtg 201gcttcagtaa aattagcggg caattcatct ctttgcccca ttaacggatg 251ggctgtatac agtaaggaca acagtataag gatcggttcc aagggggatg 301tgtttgttat aagagagccg ttcatctcat gctcccactt ggaatgcaga 351actttctttt tgactcaggg agccttgctg aatgacaagc actccaatgg 401gactgtcaaa gacagaagcc ctcacagaac attaatgagt tgtcctgtgg 451gtgaggctcc ctccccatat aactcaaggt ttgagtctgt tgcttggtca 501gcaagtgctt gccatgatgg caccagttgg ttgacgattg gaatttctgg 551cccagacaat ggggctgtgg ctgtattgaa atacaatggc ataataacag 601acactatcaa gagttggagg aacaacatac tgagaactca agagtctgaa 651tgtgcatgtg taaatggctc ttgctttact gtaatgactg acggaccaag 701taatggtcag gcatcacata agatcttcaa aatggaaaaa gggaaagtgg 751ttaaatcagt cgaattggat gctcctaatt atcactatga ggaatgctcc 801tgttatccta atgccggaga aatcacatgt gtgtgcaggg ataattggca 851tggctcaaat cggccatggg tatctttcaa tcaaaatttg gagtatcaaa 901taggatatat atgcagtgga gttttcggag acaatccacg ccccaatgat 951ggaacaggta gttgtggtcc ggtgtcctct aacggggcat atggggtaaa 1001agggttttca tttaaatacg gcaatggtgt ctggatcggg agaaccaaaa 1051gcactaattc caggagcggc tttgaaatga tttgggatcc aaatgggtgg 1101actgaaacgg acagtagctt ttcagtgaaa caagatatcg tagcaataac 1151tgattggtca ggatatagcg ggagttttgt ccagcatcca gaactgacag 1201gactagattg cataagacct tgtttctggg ttgagttgat cagagggcgg 1251cccaaagaga gcacaatttg gactagtggg agcagcatat ctttttgtgg 1301tgtaaatagt gacactgtgg gttggtcttg gccagacggt gctgagttgc 1351cattcaccat tgacaagtag tttgttcaaa aaactccttg tttctact

Amino acid sequence of ca A/Vietnam/1203/04 N1(SEQ ID NO: 12) Entire molecule length: 449 aa 1mnpnqkiiti gsicmvtgiv slmlqignmi siwvshsiht gnqhqsepis 51ntnfltekav asvklagnss lcpingwavy skdnsirigs kgdvfvirep 101fiscshlecr tffltqgall ndkhsngtvk drsphrtlms cpvgeapspy 151nsrfesvaws asachdgtsw ltigisgpdn gavavlkyng iitdtikswr 201nnilrtqese cacvngscft vmtdgpsngq ashkifkmek gkvvksveld 251apnyhyeecs cypnageitc vcrdnwhgsn rpwvsfnqnl eyqigyicsg 301vfgdnprpnd gtgscgpvss ngaygvkgfs fkygngvwig rtkstnsrsg 351femiwdpngw tetdssfsvk qdivaitdws gysgsfvqhp eltgldcirp 401cfwvelirgr pkestiwtsg ssisfcgvns dtvgwswpdg aelpftidkCa A/Hong Kong/213/03

Nucleotide Sequence of ca A/Hong Kong/213/03 H5(SEQ ID NO: 3) Entire molecule length: 1767 nt 1agcaaaagca ggggttcaat ctgtcaaaat ggagaaaata gtgcttcttt 51ttgcaatagt cagtcttgtt aaaagtgatc agatttgcat tggttaccat 101gcaaacaact cgacagagca ggttgacaca ataatggaaa agaacgttac 151tgttacacat gcccaagaca tactggaaaa gacacacaac gggaagctct 201gcgatctaga tggagtgaag cctctaattt tgagagattg tagtgtagct 251ggatggctcc tcggaaaccc aatgtgtgac gaattcatca atgtgccgga 301atggtcttac atagtggaga aggccaatcc agccaatgac ctctgttacc 351caggggattt caacgactat gaagaattga aacacctatt gagcagaata 401aaccattttg agaaaattca gatcatcccc aaaaattctt ggtccagtca 451tgaagcctca ttaggggtga gctcagcatg tccataccaa ggaaagtcct 501cctttttcag gaatgtggta tggcttatca aaaagaacaa tgcataccca 551acaataaaga ggagctacaa taataccaac caagaagatc ttttggtatt 601gtgggggatt caccatccta atgatgcggc agagcagact aggctctatc 651aaaacccaac cacctacatt tccgttggga catcaacact aaaccagaga 701ttggtaccaa aaatagctac tagatccaaa gtaaacgggc aaaatggaag 751gatggagttc ttctggacaa ttttaaaacc gaatgatgca atcaacttcg 801agagcaatgg aaatttcatt gctccagaat atgcatacaa aattgtcaag 851aaaggggact cagcaattat gaaaagtgaa ttggaatatg gtaactgcaa 901caccaagtgt caaactccaa tgggggcgat aaactctagt atgccattcc 951acaatataca ccctctcacc atcggggaat gccccaaata tgtgaaatca 1001aacagattag tccttgcgac tgggctcaga aatagccctc aaagagagac 1051tcgaggatta tttggagcta tagcaggttt tatagaggga ggatggcagg 1101gaatggtaga tggttggtat gggtaccacc atagcaatga gcaggggagt 1151gggtacgctg cagacaaaga atccactcaa aaggcaatag atggagtcac 1201caataaggtc aactcgatca ttgacaaaat gaacactcag tttgaggccg 1251ttggaaggga atttaataac ttagaaagga gaatagagaa tttaaacaag 1301aagatggaag acggattcct agatgtctgg acttataatg ctgaacttct 1351ggttctcatg gaaaatgaga gaactctaga ctttcatgac tcaaatgtca 1401agaaccttta cgacaaggtc cgactacagc ttagggataa tgcaaaggag 1451ctgggtaacg gttgtttcga gttctatcac aaatgtgata atgaatgtat 1501ggaaagtgta agaaacggaa cgtatgacta cccgcagtat tcagaagaag 1551caagactaaa aagagaggaa ataagtggag taaaattgga gtcaatagga 1601acttaccaaa tactgtcaat ttattctaca gtggcgagtt ccctagcact 1651ggcaatcatg gtagctggtc tatctttatg gatgtgctcc aatgggtcgt 1701tacaatgcag aatttgcatt taaatttgtg agttcagatt gtagttaaaa 1751acacccttgt ttctact

Amino acid sequence of ca A/Hong Kong/213/03 H5(SEQ ID NO: 13) Entire molecule length: 564 aa 1mekivllfai vslvksdqic igyhannste qvdtimeknv tvthaqdile 51kthngklcdl dgvkplilrd csvagwllgn pmcdefinvp ewsyivekan 101pandlcypgd fndyeelkhl lsrinhfeki qiipknswss heaslgvssa 151cpyqgkssff rnvvwlikkn nayptikrsy nntnqedllv lwgihhpnda 201aeqtrlyqnp ttyisvgtst lnqrlvpkia trskvngqng rmeffwtilk 251pndainfesn gnfiapeyay kivkkgdsai mkseleygnc ntkcqtpmga 301inssmpfhni hpltigecpk yvksnrlvla tglrnspqre trglfgaiag 351fieggwqgmv dgwygyhhsn eqgsgyaadk estqkaidgv tnkvnsiidk 401mntqfeavgr efnnlerrie nlnkkmedgf ldvwtynael lvlmenertl 451dfhdsnvknl ydkvrlqlrd nakelgngcf efyhkcdnec mesvrngtyd 501ypqyseearl kreeisgvkl esigtyqils iystvassla laimvaglsl 551wmcsngslqc rici

Nucleotide Sequence of ca A/Hong Kong/213/03 N1(SEQ ID NO: 4) Entire molecule length: 1458 nt 1agcaaaagca ggagttcaaa atgaatccaa atcagaagat aacaaccatt 51ggatcaatct gtatggtaat tggaatagtt agcttgatgt tacaaattgg 101gaacataatc tcaatatggg ttagtcattc aattcaaaca gggaatcaac 151accaggctga accatgcaat caaagcatta ttacttatga aaacaacacc 201tgggtaaacc agacatatgt caacatcagc aataccaatt ttcttactga 251gaaagctgtg gcttcagtaa cattagcggg caattcatct ctttgcccca 301ttagtggatg ggctgtatac agtaaggaca acggtataag aatcggttcc 351aagggggatg tgtttgttat aagagagccg ttcatctcat gctcccactt 401ggaatgcaga actttctttt tgactcaggg agccttgctg aatgacaagc 451attctaatgg gaccgtcaaa gacagaagcc ctcacagaac attaatgagt 501tgtcccgtgg gtgaggctcc ttccccatac aactcgaggt ttgagtctgt 551tgcttggtcg gcaagtgctt gtcatgatgg cactagttgg ttgacaattg 601gaatttctgg cccagacaat ggggctgtgg ctgtattgaa atacaatggc 651ataataacag acactatcaa gagttggagg aacaacataa tgagaactca 701agagtctgaa tgtgcatgtg taaatggctc ttgctttact gttatgactg 751atggaccaag taatgggcag gcttcataca aaatcttcag aatagaaaaa 801gggaaagtag ttaaatcagc cgaattaaat gcccctaatt atcactatga 851ggagtgctcc tgttatcctg atgctggaga aatcacatgt gtgtgcaggg 901ataactggca tggctcaaat cggccatggg tatctttcaa tcaaaatttg 951gagtatcgaa taggatatat atgcagtgga gttttcggag acaatccacg 1001ccccaatgat gggacaggca gttgtggtcc ggtgtcccct aaaggggcat 1051atggaataaa agggttctca tttaaatacg gcaatggtgt ttggatcggg 1101agaaccaaaa gcactaattc caggagcggc tttgaaatga tttgggatcc 1151aaatggatgg actggtacgg acagtaattt ttcagtaaag caagatattg 1201tagctataac cgattggtca ggatatagcg ggagttttgt ccagcatcca 1251gaactgacag gattagattg cataagacct tgtttctggg ttgagctaat 1301cagagggcgg cccaaagaga gcacaatttg gactagtggg agcagcatat 1351ccttttgtgg tgtaaatagt gacactgtgg gttggtcttg gccagacggt 1401gctgagttgc cattcaccat tgacaagtag tttgttcaaa aaactccttg 1451 tttctact

Amino acid sequence of ca A/Hong Kong/213/03 N1(SEQ ID NO: 14) Entire molecule length: 469 aa 1mnpnqkitti gsicmvigiv slmlqignii siwvshsiqt gnqhqaepcn 51qsiityennt wvnqtyvnis ntnfltekav asvtlagnss lcpisgwavy 101skdngirigs kgdvfvirep fiscshlecr tffltqgall ndkhsngtvk 151drsphrtlms cpvgeapspy nsrfesvaws asachdgtsw ltigisgpdn 201gavavlkyng iitdtikswr nnimrtqese cacvngscft vmtdgpsngq 251asykifriek gkvvksaeln apnyhyeecs cypdageitc vcrdnwhgsn 301rpwvsfnqnl eyrigyicsg vfgdnprpnd gtgscgpvsp kgaygikgfs 351fkygngvwig rtkstnsrsg femiwdpngw tgtdsnfsvk qdivaitdws 401gysgsfvqhp eltgldcirp cfwvelirgr pkestiwtsg ssisfcgvns 451dtvgwswpdg aelpftidkca A/Hong Kong/491/97 (HA)+A/Hong Kong/486/97 (NA)

Nucleotide Sequence of ca A/Hong Kong/491/97 H5(SEQ ID NO: 5) Entire molecule length: 1767 nt 1agcaaaagca ggggtataat ctgtcaaaat ggagaaaata gtgcttcttc 51ttgcaacagt cagccttgtt aaaagtgacc agatttgcat tggttaccat 101gcaaacaact cgacagagca agttgacaca ataatggaaa agaatgttac 151tgttacacat gcccaagaca tactggaaag gacacacaac gggaagctct 201gcgatctaaa tggagtgaag cctctgattt tgagggattg tagtgtagct 251ggatggctcc tcggaaaccc tatgtgtgac gaattcatca atgtgccgga 301atggtcttac atagtggaga aggccagtcc agccaatgac ctctgttatc 351cagggaattt caacgactat gaagaactga aacacctatt gagcagaata 401aaccattttg agaaaattca gataatcccc aaaagttctt ggtccaatca 451tgatgcctca tcaggggtga gctcagcatg tccatacctt gggaggtcct 501cctttttcag aaatgtggta tggcttatca aaaagaacag tagctaccca 551acaataaaga ggagctacaa taataccaac caagaagatc ttttggtact 601gtgggggatt caccatccta atgatgcggc agagcagaca aggctctatc 651aaaacccaac cacctacatt tccgttggaa catcaacact gaaccagaga 701ttggttccag aaatagctac tagacccaaa gtaaacgggc aaagtggaag 751aatggagttc ttctggacaa ttttaaagcc gaatgatgcc atcaatttcg 801agagtaatgg aaatttcatt gctccagaat atgcatacaa aattgtcaag 851aaaggggact caacaattat gaaaagtgaa ttggaatatg gtaactgcaa 901caccaagtgt caaactccaa tgggggcaat aaactctagt atgccattcc 951acaacataca ccccctcacc atcggggaat gccccaaata tgtgaaatca 1001aacagattag tccttgcaac tggactcaga aatacccctc aacgagagac 1051gcgaggacta tttggagcta tagcaggttt tatagaggga ggatggcagg 1101gaatggtaga tggttggtat gggtaccacc atagcaatga gcaggggagt 1151ggatacgctg cagaccaaga atccacacaa aaggcaatag atggagtcac 1201caataaggtc aactcgatca ttaacaaaat gaacactcag tttgaggccg 1251ttggaaggga atttaataac ttggaaagga ggatagagaa tttaaacaag 1301aaaatggaag acggattcct agatgtctgg acttacaatg ccgaacttct 1351ggttctcatg gaaaatgaga gaactctaga ctttcatgac tcaaatgtca 1401agaaccttta cgacaaggtc cgactacagc ttagggataa tgcaaaggag 1451ctgggtaatg gttgtttcga attctatcac aaatgtgata acgaatgtat 1501ggaaagtgta aaaaacggaa cgtatgacta cccgcagtat tcagaagaag 1551caagactaaa cagagaggaa ataagtggag taaaattgga atcaatggga 1601acttaccaaa tactgtcaat ttattcaaca gtggcgagtt ccctagcact 1651ggcaatcatg gtagctggtc tatctttatg gatgtgctcc aatggatcgt 1701tacaatgcag aatttgcatt taaatttgtg agttcagatt gtagttaaaa 1751acacccttgt ttctact

Amino acid sequence of ca A/Hong Kong/491/97 H5 (SEQ ID NO: 15) Entiremolecule length: 564 aa 1 mekivlllat vslvksdqic igyhannste qvdtimeknvtvthaqdile 51 rthngklcdl ngvkplilrd csvagwllgn pmcdefinvp ewsyivekas 101pandlcypgn fndyeelkhl lsrinhfeki qiipksswsn hdassgvssa 151 cpylgrssffrnvvwlikkn ssyptikrsy nntnqedllv lwgihhpnda 201 aeqtrlyqnp ttyisvgtstlnqrlvpeia trpkvngqsg rmeffwtilk 251 pndainfesn gnfiapeyay kivkkgdstimkseleygnc ntkcqtpmga 301 inssmpfhni hpltigecpk yvksnrlvla tglrntpqretrglfgaiag 351 fieggwqgmv dgwygyhhsn eqgsgyaadq estqkaidgv tnkvnsiink401 mntqfeavgr efnnlerrie nlnkkmedgf ldvwtynael lvlmenertl 451dfhdsnvknl ydkvrlqlrd nakelgngcf efyhkcdnec mesvkngtyd 501 ypqyseearlnreeisgvkl esmgtyqils iystvassla laimvaglsl 551 wmcsngslqc rici

Nucleotide Sequence of ca A/Hong Kong/486/97 N1 (SEQ ID NO: 6) Entiremolecule length: 1401 nt 1 agcaaaagca ggagtttaaa atgaatccaa atcagaagataataaccatt 51 ggatcaatct gcatggtagt tgggataatc agcttgatgt tacaaattgg 101aaacacaata tcagtatggg tcagccacat aattaaaact tggcacccaa 151 accagcctgaaccatgcaac caaagcatca atttttacac tgagcaggct 201 gcagcttcag tgacattagcgggcaattcc tctctctgcc ctattagtgg 251 atgggctata tacagcaagg acaatagtataagaattggt tccaaagggg 301 atgtgtttgt tataagagaa ccattcatct catgctcccatttggaatgc 351 agaacctttt tcttgaccca aggagcccta ttgaatgaca agcattctaa401 tgggaccgtc aaagacagga gcccctatag aactttaatg agctgtcctg 451ttggtgaggc cccttcccca tacaactcaa ggtttgagtc tgttgcttgg 501 tcagcaagtgcttgccatga tggcattagt tggctaacaa ttggaatttc 551 cggtccggat aatggggctgtggctgtgtt gaaatacaat ggcataataa 601 cagacaccat caagagttgg aggaacaacacactgaggac gcaagagtct 651 gaatgtgcat gtgtgaatgg ttcttgtttt actgtaatgacagatggacc 701 gagtaatgaa caggcctcat acaagatttt caagatagaa aaggggaggg751 tagtcaaatc agttgagttg aacgccccta attatcatta cgaggaatgc 801tcctgttatc ctgatgctgg cgaaatcaca tgtgtgtgca gggataattg 851 gcatggctcgaaccgaccat gggtgtcttt caatcagaat ctggagtatc 901 aaataggata tatatgcagtggggttttcg gagacagtcc acgccccaat 951 gatgggacag gcagttgtgg tccagtgtctcttaacggag cgtatggagt 1001 aaaagggttt tcatttaaat acggcaatgg tgtttggatcgggagaacca 1051 aaagcactag ttccaggagc ggttttgaaa tgatttggga tccaaatggg1101 tggaccgaaa cagacagtag cttctcgttg aagcaagaca tcatagcgat 1151aactgattgg tcaggataca gcgggagttt tattcaacat ccagaactga 1201 caggattaaattgcatgaga ccttgcttct gggttgaact aatcagaggg 1251 aggcccaaag agaaaacaatctggactagt gggagcagta tatctttctg 1301 tggtgtaaat agtgacactg tgggttggtcttggccagac ggtgctgagt 1351 tgccatacac cattgacaag tagtttgttc aaaaaactccttgtttctac 1401 t

Amino acid sequence of ca A/Hong Kong/486/97 N1 (SEQ ID NO: 16) Entiremolecule length: 450 aa 1 mnpnqkiiti gsicmvvgii slmlqignti svwvshiiktwhpnqpepcn 51 qsinfyteqa aasvtlagns slcpisgwai yskdnsirig skgdvfvire 101pfiscshlec rtffltqgal lndkhsngtv kdrspyrtlm scpvgeapsp 151 ynsrfesvawsasachdgis wltigisgpd ngavavlkyn giitdtiksw 201 rnntlrtqes ecacvngscftvmtdgpsne qasykifkie kgrvvksvel 251 napnyhyeec scypdageit cvcrdnwhgsnrpwvsfnqn leyqigyics 301 gvfgdsprpn dgtgscgpvs lngaygvkgf sfkygngvwigrtkstssrs 351 gfemiwdpng wtetdssfsl kqdiiaitdw sgysgsfiqh peltglncmr401 pcfwvelirg rpkektiwts gssisfcgvn sdtvgwswpd gaelpytidkca A/Hong Kong/491/97 (Ser211) (HA)+ca A/Hong Kong/486/97 (NA)

Nucleotide Sequence of ca A/Hong Kong/491/97 (Ser211) H5 (SEQ ID NO: 7)Entire molecule length: 1767 nt 1 agcaaaagca ggggtataat ctgtcaaaatggagaaaata gtgcttcttc 51 ttgcaacagt cagccttgtt aaaagtgacc agatttgcattggttaccat 101 gcaaacaact cgacagagca agttgacaca ataatggaaa agaatgttac151 tgttacacat gcccaagaca tactggaaag gacacacaac gggaagctct 201gcgatctaaa tggagtgaag cctctgattt tgagggattg tagtgtagct 251 ggatggctcctcggaaaccc tatgtgtgac gaattcatca atgtgccgga 301 atggtcttac atagtggagaaggccagtcc agccaatgac ctctgttatc 351 cagggaattt caacgactat gaagaactgaaacacctatt gagcagaata 401 aaccattttg agaaaattca gataatcccc aaaagttcttggtccaatca 451 tgatgcctca tcaggggtga gctcagcatg tccatacctt gggaggtcct501 cctttttcag aaatgtggta tggcttatca aaaagaacag tagctaccca 551acaataaaga ggagctacaa taataccaac caagaagatc ttttggtact 601 gtgggggattcaccatccta atgatgcggc agagcagaca aggctctatc 651 aaaacccaac cacctacatttccgttggaa catcaacact gaaccagaga 701 ttggtttcag aaatagctac tagacccaaagtaaacgggc aaagtggaag 751 aatggagttc ttctggacaa ttttaaagcc gaatgatgccatcaatttcg 801 agagtaatgg aaatttcatt gctccagaat atgcatacaa aattgtcaag851 aaaggggact caacaattat gaaaagtgaa ttggaatatg gtaactgcaa 901caccaagtgt caaactccaa tgggggcaat aaactctagt atgccattcc 951 acaacatacaccccctcacc atcggggaat gccccaaata tgtgaaatca 1001 aacagattag tccttgcaactggactcaga aatacccctc aacgagagac 1051 gcgaggacta tttggagcta tagcaggttttatagaggga ggatggcagg 1101 gaatggtaga tggttggtat gggtaccacc atagcaatgagcaggggagt 1151 ggatacgctg cagaccaaga atccacacaa aaggcaatag atggagtcac1201 caataaggtc aactcgatca ttaacaaaat gaacactcag tttgaggccg 1251ttggaaggga atttaataac ttggaaagga ggatagagaa tttaaacaag 1301 aaaatggaagacggattcct agatgtctgg acttacaatg ccgaacttct 1351 ggttctcatg gaaaatgagagaactctaga ctttcatgac tcaaatgtca 1401 agaaccttta cgacaaggtc cgactacagcttagggataa tgcaaaggag 1451 ctgggtaatg gttgtttcga attctatcac aaatgtgataacgaatgtat 1501 ggaaagtgta aaaaacggaa cgtatgacta cccgcagtat tcagaagaag1551 caagactaaa cagagaggaa ataagtggag taaaattgga atcaatggga 1601acttaccaaa tactgtcaat ttattcaaca gtggcgagtt ccctagcact 1651 ggcaatcatggtagctggtc tatctttatg gatgtgctcc aatggatcgt 1701 tacaatgcag aatttgcatttaaatttgtg agttcagatt gtagttaaaa 1751 acacccttgt ttctact

Amino acid sequence of ca A/Hong Kong/491/97 (Ser211) H5 (SEQ ID NO: 17)Entire molecule length: 564 aa 1 mekivlllat vslvksdqic igyhannsteqvdtimeknv tvthaqdile 51 rthngklcdl ngvkplilrd csvagwllgn pmcdefinvpewsyivekas 101 pandlcypgn fndyeelkhl lsrinhfeki qiipksswsn hdassgvssa151 cpylgrssff rnvvwlikkn ssyptikrsy nntnqedllv lwgihhpnda 201aeqtrlyqnp ttyisvgtst lnqrlvseia trpkvngqsg rmeffwtilk 251 pndainfesngnfiapeyay kivkkgdsti mkseleygnc ntkcqtpmga 301 inssmpfhni hpltigecpkyvksnrlvla tglrntpqre trglfgaiag 351 fieggwqgmv dgwygyhhsn eqgsgyaadqestqkaidgv tnkvnsiink 401 mntqfeavgr efnnlerrie nlnkkmedgf ldvwtynaellvlmenertl 451 dfhdsnvknl ydkvrlqlrd nakelgngcf efyhkcdnec mesvkngtyd501 ypqyseearl nreeisgvkl esmgtyqils iystvassla laimvaglsl 551wmcsngslqc rici

Nucleotide Sequence of ca A/Hong Kong/486/97 N1 (SEQ ID NO: 8) Entiremolecule length: 1401 nt 1 agcaaaagca ggagtttaaa atgaatccaa atcagaagataataaccatt 51 ggatcaatct gcatggtagt tgggataatc agcttgatgt tacaaattgg 101aaacacaata tcagtatggg tcagccacat aattaaaact tggcacccaa 151 accagcctgaaccatgcaac caaagcatca atttttacac tgagcaggct 201 gcagcttcag tgacattagcgggcaattcc tctctctgcc ctattagtgg 251 atgggctata tacagcaagg acaatagtataagaattggt tccaaagggg 301 atgtgtttgt tataagagaa ccattcatct catgctcccatttggaatgc 351 agaacctttt tcttgaccca aggagcccta ttgaatgaca agcattctaa401 tgggaccgtc aaagacagga gcccctatag aactttaatg agctgtcctg 451ttggtgaggc cccttcccca tacaactcaa ggtttgagtc tgttgcttgg 501 tcagcaagtgcttgccatga tggcattagt tggctaacaa ttggaatttc 551 cggtccggat aatggggctgtggctgtgtt gaaatacaat ggcataataa 601 cagacaccat caagagttgg aggaacaacacactgaggac gcaagagtct 651 gaatgtgcat gtgtgaatgg ttcttgtttt actgtaatgacagatggacc 701 gagtaatgaa caggcctcat acaagatttt caagatagaa aaggggaggg751 tagtcaaatc agttgagttg aacgccccta attatcatta cgaggaatgc 801tcctgttatc ctgatgctgg cgaaatcaca tgtgtgtgca gggataattg 851 gcatggctcgaaccgaccat gggtgtcttt caatcagaat ctggagtatc 901 aaataggata tatatgcagtggggttttcg gagacagtcc acgccccaat 951 gatgggacag gcagttgtgg tccagtgtctcttaacggag cgtatggagt 1001 aaaagggttt tcatttaaat acggcaatgg tgtttggatcgggagaacca 1051 aaagcactag ttccaggagc ggttttgaaa tgatttggga tccaaatggg1101 tggaccgaaa cagacagtag cttctcgttg aagcaagaca tcatagcgat 1151aactgattgg tcaggataca gcgggagttt tattcaacat ccagaactga 1201 caggattaaattgcatgaga ccttgcttct gggttgaact aatcagaggg 1251 aggcccaaag agaaaacaatctggactagt gggagcagta tatctttctg 1301 tggtgtaaat agtgacactg tgggttggtcttggccagac ggtgctgagt 1351 tgccatacac cattgacaag tagtttgttc aaaaaactccttgtttctac 1401 t

Amino acid sequence of ca A/Hong Kong/486/97 N1 (SEQ ID NO: 18) Entiremolecule length: 450 aa 1 mnpnqkiiti gsicmvvgii slmlqignti svwvshiiktwhpnqpepcn 51 qsinfyteqa aasvtlagns slcpisgwai yskdnsirig skgdvfvire 101pfiscshlec rtffltqgal lndkhsngtv kdrspyrtlm scpvgeapsp 151 ynsrfesvawsasachdgis wltigisgpd ngavavlkyn giitdtiksw 201 rnntlrtqes ecacvngscftvmtdgpsne qasykifkie kgrvvksvel 251 napnyhyeec scypdageit cvcrdnwhgsnrpwvsfnqn leyqigyics 301 gvfgdsprpn dgtgscgpvs lngaygvkgf sfkygngvwigrtkstssrs 351 gfemiwdpng wtetdssfsl kqdiiaitdw sgysgsfiqh peltglncmr401 pcfwvelirg rpkektiwts gssisfcgvn sdtvgwswpd gaelpytidkca A/ck/Hong Kong/G9/97

Nucleotide sequence of ca A/ck/Hong Kong/G9/97 (SEQ ID NO: 9) Entiremolecule length: 1690 bp 1 ttaaccactc aagatggaag caataccact aataactatactactagtag 51 taacagcaag caatgcagac aaaatctgca tcggctacca atcaacaaac 101tccacagaaa ccgtagacac gctaacagaa aacaatgttc ctgtgacaca 151 tgccaaagaattgctccaca cagagcacaa tgggatgctg tgtgcaacaa 201 atctgggacg tcctcttattctagacactt gcaccattga aggactgatc 251 tatggcaacc cttcttgtga tctactgttgggaggaagag aatggtccta 301 catcgtcgaa agaccatcgg ctgttaatgg aatgtgttaccccgggaatg 351 tagaaaacct agaggaacta aggtcatttt ttagttctgc tagttcctac401 caaagaatcc agatctttcc agacacaatc tggaatgtgt cttacagtgg 451aacaagcaaa gcatgttcag attcattcta caggagcatg agatggttga 501 ctcaaaagaacaacgcttac cctattcaag acgcccaata cacaaataat 551 agaggaaaga gcattcttttcatgtggggc ataaatcacc cacctaccga 601 tactgcacag acaaatctgt acacaaggactgacacaaca acaagtgtgg 651 caacagaaga tataaatagg accttcaaac cagtgatagggccaaggccc 701 cttgtcaatg gtctgcaggg aagaattgat tattattggt cggtattgaa751 accaggtcag acattgcgag taagatccaa tgggaatcta atcgctccat 801ggtatgggca cattctttca ggagagagcc acggaagaat cctgaagact 851 gatttaaacagtggtagctg tgtagtgcaa tgtcaaacag aaagaggtgg 901 cttaaatact actttgccattccacaatgt cagtaaatat gcatttggaa 951 actgcccaaa atatgttgga gtaaagagtctcaaactggc agttggtctg 1001 aggaatgtgc ctgctagatc aagtagagga ctatttggggccatagctgg 1051 attcatagag ggaggttggt cagggctggt cgctggttgg tatgggttcc1101 agcattcaaa tgatcaaggg gttggtatag ctgcagatag agactcaact 1151caaagggcaa ttgacaaaat aacgtccaaa gtgaataata tagtcgataa 1201 aatgaacaagcaatatgaaa ttattgatca tgaattcagc gaggttgaaa 1251 atagactcaa tatgatcaataataagattg atgaccagat acaagacata 1301 tgggcatata acgctgaatt gctagtgctgcttgaaaacc agaaaacact 1351 cgatgagcat gatgcgaatg taaacaatct atataacaaagtgaagaggg 1401 cactgggttc caatgcaatg gaagatggga aaggatgttt cgagctatac1451 cataaatgtg atgatcagtg catggagaca attcggaacg ggacctataa 1501caggaggaag tataaagagg aatcaagact agaaagacag aaaatagaag 1551 gggtcaagctggaatctgaa ggaacttaca aaatcctcac catttattcg 1601 actgtcgcct catctcttgtgattgcaatg gggtttgctg ccttcttgtt 1651 ctgggccatg tccaatggat cttgcagatgcaacatttga

Amino acid sequence of ca A/ck/Hong Kong/G9/97 H9 (SEQ ID NO: 19) Entiremolecule length: 558 aa 1 meaipiltil lvvtasnadk icigyqstns tetvdtltennvpvthakel 51 lhtehngmlc atnlgrplil dtctiegliy gnpscdlllg grewsyiver 101psavngmcyp gnvenleelr sffssassyq riqifpdtiw nvsysgtska 151 csdsfyrsmrwltqknnayp iqdaqytnnr gksilfmwgi nhpptdtaqt 201 nlytrtdttt svatedinrtfkpvigprpl vnglqgridy ywsvlkpgqt 251 lrvrsngnli apwyghilsg eshgrilktdlnsgscvvqc qtergglntt 301 lpfhnvskya fgncpkyvgv kslklavglr nvparssrglfgaiagfieg 351 gwsglvagwy gfqhsndqgv giaadrdstq raidkitskv nnivdkmnkq401 yeiidhefse venrlnminn kiddqiqdiw aynaellvll enqktldehd 451anvnnlynkv kralgsname dgkgcfelyh kcddqcmeti rngtynrrky 501 keesrlerqkiegvkleseg tykiltiyst vasslviamg faaflfwams 551 ngscrcni

Nucleotide sequence of ca A/ck/Hong Kong/G9/97 N2 (SEQ ID NO: 10) Entiremolecule length: 1428 bp 1 aaatgaatcc aaatcagaag ataatagcaa ttggctctgtttctctaact 51 attgcgacaa tatgcctcct catgcagatt gctatcttag caacgactat 101gacactacat ttcaagcaga atgaatgcat caactcctcg aataatcaag 151 tagtgccatgtgaaccaatc ataatagaaa ggaacataac agagatagtg 201 catttgaata gtactaccttagagaaggaa atttgtccta aagtagcaga 251 ctacaggaat tggtcaaaac cacaatgtcaaatcacaggg ttcgctcctt 301 tctccaagga caattcaatt aggctctccg caggtggagatatttgggtg 351 acaagagaac cttatgtatc gtgcggtctt ggtaaatgtt atcaatttgc401 acttgggcag ggaaccactt tggagaacaa acactcaaac ggcacagcac 451atgatagaac tcctcataga acccttttaa tgaatgagtt gggtgttccg 501 tttcatttggcaaccaaaca agtgtgcata gcatggtcca gctcaagctg 551 ccatgatggg aaagcatggttacatgtttg tgtcactggg gatgatagaa 601 atgcaacggc tagcatcatt tatgatgggatacttgttga cagtattggt 651 tcatggtcta aaaacatcct cagaactcag gagtcagaatgcgtttgcat 701 caatggaacc tgtgcagtag taatgactga tggaagtgca tcaggaaggg751 ctgacactag aatactattt attagagagg ggaaaattgc acacattagc 801ccattgtcag gaagtgctca gcatgtggag gaatgctcct gttacccccg 851 atatccagaagttagatgtg tttgcagaga caattggaag ggatccaata 901 ggcccgttct atatataaatatggcaaatt atagtattga ttccagttat 951 gtgtgctcag gacttgttgg cgacacaccaagaaatgatg ataggtctag 1001 cagcagcaac tgcagagatc ctaataacga gagaggggccccaggagtaa 1051 aagggtgggc ctttgacaat ggaaatgaca tttggatggg aagaacaatc1101 aaaaaggatt cgcgctcagg ttatgagact ttcagggtca ttggtggttg 1151gaccactgct aattccaagt cacagataaa tagacaagtc atagttgaca 1201 gtgataactcgtctgggtat tctggtatct tctctgttga aggcaaaagc 1251 tgcatcaaca ggtgtttttacgtggagttg ataagaggaa gaccaaagga 1301 gactagggtg tggtggactt caaatagcatcattgtattt tgtggaactt 1351 caggtaccta tggaacaggc tcatggcctg atggggcgaatatcaatttc 1401 atgcctatat aagctttcgc aattttag

Amino acid sequence of ca A/ck/Hong Kong/G9/97 N2 (SEQ ID NO: 20) Entiremolecule length: 469 aa 1 mnpnqkiiai gsvsltiati cllmqiaila ttmtlhfkqnecinssnnqv 51 vpcepiiier niteivhlns ttlekeicpk vadyrnwskp qcqitgfapf 101skdnsirlsa ggdiwvtrep yvscglgkcy qfalgqgttl enkhsngtah 151 drtphrtllmnelgvpfhla tkqvciawss sschdgkawl hvcvtgddrn 201 atasiiydgi lvdsigswsknilrtqesec vcingtcavv mtdgsasgra 251 dtrilfireg kiahisplsg saqhveecscyprypevrcv crdnwkgsnr 301 pvlyinmany sidssyvcsg lvgdtprndd rssssncrdpnnergapgvk 351 gwafdngndi wmgrtikkds rsgyetfrvi ggwttansks qinrqvivds401 dnssgysgif svegkscinr cfyvelirgr pketrvwwts nsiivfcgts 451gtygtgswpd ganinfmpiA/Netherlands/219/03

Nucleotide sequence of A/Netherlands/219/03 H7 (SEQ ID NO: 21) Entiremolecule length: 1737 bp 1 agcaaaagca ggggatacaa aatgaacact caaatcctggtattcgctct 51 ggtggcgagc attccgacaa atgcagacaa gatctgcctt gggcatcatg 101ccgtgtcaaa cgggactaaa gtaaacacat taactgagag aggagtggaa 151 gtcgttaatgcaactgaaac ggtggaacga acaaacgttc ccaggatctg 201 ctcaaaaggg aaaaggacagttgacctcgg tcaatgtgga cttctgggaa 251 caatcactgg gccaccccaa tgtgaccaattcctagaatt ttcggccgac 301 ttaattattg agaggcgaga aggaagtgat gtctgttatcctgggaaatt 351 cgtgaatgaa gaagctctga ggcaaattct cagagagtca ggcggaattg401 acaaggagac aatgggattc acctacagcg gaataagaac taatggaaca 451accagtgcat gtaggagatc aggatcttca ttctatgcag agatgaaatg 501 gctcctgtcaaacacagaca atgctgcttt cccgcaaatg actaagtcat 551 acaagaacac aaggaaagacccagctctga taatatgggg gatccaccat 601 tccggatcaa ctacagaaca gaccaagctatatgggagtg gaaacaaact 651 gataacagtt gggagttcta attaccaaca gtcctttgtaccgagtccag 701 gagcgagacc acaagtgaat ggccaatctg gaagaattga ctttcattgg751 ctgatactaa accctaatga cacggtcact ttcagtttca atggggcctt 801catagctcca gaccgtgcaa gctttctgag agggaagtcc atgggaattc 851 agagtgaagtacaggttgat gccaattgtg aaggagattg ctatcatagt 901 ggagggacaa taataagtaatttgcccttt cagaacataa atagcagggc 951 agtaggaaaa tgtccgagat atgttaagcaagagagtctg ctgttggcaa 1001 caggaatgaa gaatgttccc gaaatcccaa agaggaggaggagaggccta 1051 tttggtgcta tagcgggttt cattgaaaat ggatgggaag gtttgattga1101 tgggtggtat ggcttcaggc atcaaaatgc acaaggggag ggaactgctg 1151cagattacaa aagcacccaa tcagcaattg atcaaataac agggaaatta 1201 aatcggcttatagaaaaaac taaccaacag tttgagttaa tagacaacga 1251 attcactgag gttgaaaggcaaattggcaa tgtgataaac tggaccagag 1301 attccatgac agaagtgtgg tcctataacgctgaactctt agtagcaatg 1351 gagaatcagc acacaattga tctggccgac tcagaaatgaacaaactgta 1401 cgaacgagtg aagagacaac tgagagagaa tgccgaagaa gatggcactg1451 gttgcttcga aatatttcac aagtgtgatg acgactgcat ggccagtatt 1501agaaacaaca cctatgatca cagcaagtac agggaagaag caatacaaaa 1551 tagaatacagattgacccag tcaaactaag cagcggctac aaagatgtga 1601 tactttggtt tagcttcggggcatcatgtt tcatacttct ggccattgca 1651 atgggccttg tcttcatatg tgtgaagaatggaaacatgc ggtgcactat 1701 ttgtatataa gtttggaaaa acacccttgt ttctact

Amino acid sequence of A/Netherlands/219/03 H7 (SEQ ID NO: 27) Entiremolecule length: 562 aa 1 mntqilvfal vasiptnadk iclghhavsn gtkvntltergvevvnatet 51 vertnvpric skgkrtvdlg qcgllgtitg ppqcdqflef sadliierre 101gsdvcypgkf vneealrqil resggidket mgftysgirt ngttsacrrs 151 gssfyaemkwllsntdnaaf pqmtksyknt rkdpaliiwg ihhsgstteq 201 tklygsgnkl itvgssnyqqsfvpspgarp qvngqsgrid fhwlilnpnd 251 tvtfsfngaf iapdrasflr gksmgiqsevqvdancegdc yhsggtiisn 301 lpfqninsra vgkcpryvkq eslllatgmk nvpeipkrrrrglfgaiagf 351 iengweglid gwygfrhqna qgegtaadyk stqsaidqit gklnrliekt401 nqqfelidne fteverqign vinwtrdsmt evwsynaell vamenqhtid 451ladsemnkly ervkrqlren aeedgtgcfe ifhkcdddcm asirnntydh 501 skyreeaiqnriqidpvkls sgykdvilwf sfgascfill aiamglvfic 551 vkngnmrcti ci

Nucleotide Sequence of A/Netherlands/219/03 N7 (SEQ ID NO: 22) Entiremolecule length: 1465 nt 1 agcaaaagca gggtgatcga gaatgaatcc aaatcagaaactatttgcat 51 tatctggagt ggcaatagca cttagtgtac tgaacttatt gataggaatc 101tcaaacgtcg gattgaacgt atctctacat ctaaaggaaa aaggacccaa 151 acaggaggagaatttaacat gcacgaccat taatcaaaac aacactactg 201 tagtagaaaa cacatatgtaaataatacaa caataattac caagggaact 251 gatttgaaaa caccaagcta tctgctgttgaacaagagcc tgtgcaatgt 301 tgaagggtgg gtcgtgatag caaaagacaa tgcagtaagatttggggaaa 351 gtgaacaaat cattgttacc agggagccat atgtatcatg cgacccaaca401 ggatgcaaaa tgtatgcctt gcaccaaggg actaccatta ggaacaaaca 451ttcaaatgga acgattcatg acagaacagc tttcagaggt ctcatctcca 501 ctccattgggcactccacca accgtaagta acagtgactt tatgtgtgtt 551 ggatggtcaa gcacaacttgccatgatggg attgctagga tgactatctg 601 tatacaagga aataatgaca atgctacagcaacggtttat tacaacagaa 651 ggctgaccac taccattaag acctgggcca gaaacattctgaggactcaa 701 gaatcagaat gtgtgtgcca caatggcaca tgtgcagttg taatgaccga751 cggatcggct agtagtcaag cctatacaaa agtaatgtat ttccacaagg 801gattagtagt taaggaggag gagttaaggg gatcagccag acatattgag 851 gaatgctcctgttatggaca caatcaaaag gtgacctgtg tgtgcagaga 901 taactggcag ggagcaaacaggcctattat agaaattgat atgagcacat 951 tggagcacac aagtagatac gtgtgcactggaattctcac agacaccagc 1001 agacctgggg acaaatctag tggtgattgt tccaatccaataactgggag 1051 tcccggcgtt ccgggagtga agggattcgg gtttctaaat ggggataaca1101 catggcttgg taggaccatc agccccagat caagaagtgg attcgaaatg 1151ttgaaaatac ctaatgcagg tactgatccc aattctagaa tagcagaacg 1201 acaggaaattgtcgacaata acaattggtc aggctattcc ggaagcttta 1251 ttgactattg gaatgataacagtgaatgct acaatccatg cttttacgta 1301 gagttaatta gaggaagacc cgaagaggctaaatacgtat ggtgggcaag 1351 taacagtcta attgccctat gtggaagccc attcccagttgggtctggtt 1401 ccttccccga tggggcacaa atccaatact tttcgtaaaa tgcaaaaaaa1451 ctccttgttt ctact

Nucleotide Sequence of A/Netherlands/219/03 N7 (SEQ ID NO: 45) Entiremolecule length: 1464 nt 1 agcaaaagca gggtgatcga gaatgaatcc aaatcagaaactatttgcat tatctggagt 61 ggcaatagca cttagtgtac tgaacttatt gataggaatctcaaacgtcg gattgaacgt 121 atctctacat ctaaaggaaa aaggacccaa acaggaggagaatttaacat gcacgaccat 181 taatcaaaac aacactactg tagtagaaaa cacatatgtaaataatacaa caataattac 241 caagggaact gatttgaaaa caccaagcta tctgctgttgaacaagagcc tgtgcaatgt 301 tgaagggtgg gtcgtgatag caaaagacaa tgcagtaagatttggggaaa gtgaacaaat 361 cattgttacc agggagccat atgtatcatg cgacccaacaggatgcaaaa tgtatgcctt 421 gcaccaaggg actaccatta ggaacaaaca ttcaaatggaacgattcatg acagaacagc 481 tttcagaggt ctcatctcca ctccattggg cactccaccaaccgtaagta acagtgactt 541 tatgtgtgtt ggatggtcaa gcacaacttg ccatgatgggattgctagga tgactatctg 601 tatacaagga aataatgaca atgctacagc aacggtttattacaacagaa ggctgaccac 661 taccattaag acctgggcca gaaacattct gaggactcaagaatcagaat gtgtgtgcca 721 caatggcaca tgtgcagttg taatgaccga cggatcggctagtagtcaag cctatacaaa 781 agtaatgtat ttccacaagg gattagtagt taaggaggaggagttaaggg gatcagccag 841 acatattgag gaatgctcct gttatggaca caatcaaaaggtgacctgtg tgtgcagaga 901 taactggcag ggagcaaaca ggcctattat agaaattgatatgagcacat tggagcacac 961 aagtagatac gtgtgcactg gaattctcac agacaccagcagacctgggg acaaatctag 1021 tggtgattgt tccaatccaa taactgggag tcccggcgttccgggagtga agggattcgg 1081 gtttctaaat ggggataaca catggcttgg taggaccatcagccccagat caagaagtgg 1141 attcgaaatg ttgaaaatac ctaatgcagg tactgatcccaattctagaa tagcagaacg 1201 acaggaaatt gtcgacaata acaattggtc aggctattccggaagcttta ttgactattg 1261 gaatgataac agtgaatgct acaatccatg cttttacgtagagttaatta gaggaagacc 1321 cgaagaggct aaatacgtat ggtgggcaag taacagtctaattgccctat gtggaagccc 1381 attcccagtt gggtctggtt ccttccccga tggggcacaaatccaatact tttcgtaaaa 1441 tgcaaaaaca cccttgtttc tact

Amino acid sequence of A/Netherlands/219/03 N7 (SEQ ID NO: 28) Entiremolecule length: 471 aa 1 mnpnqklfal sgvaialsvl nlligisnvg lnvslhlkekgpkqeenltc 51 ttinqnnttv ventyvnntt iitkgtdlkt psylllnksl cnvegwvvia 101kdnavrfges eqiivtrepy vscdptgckm yalhqgttir nkhsngtihd 151 rtafrglistplgtpptvsn sdfmcvgwss ttchdgiarm ticiqgnndn 201 atatvyynrr ltttiktwarnilrtqesec vchngtcavv mtdgsassqa 251 ytkvmyfhkg lvvkeeelrg sarhieecscyghnqkvtcv crdnwqganr 301 piieidmstl ehtsryvctg iltdtsrpgd kssgdcsnpitgspgvpgvk 351 gfgflngdnt wlgrtisprs rsgfemlkip nagtdpnsri aerqeivdnn401 nwsgysgsfi dywndnsecy npcfyvelir grpeeakyvw wasnslialc 451gspfpvgsgs fpdgaqiqyf sA/ck/BC/CN-7/04

Nucleotide sequence of A/ck/BC/CN-7/04 H7 (SEQ ID NO: 23) Entiremolecule length: 1754 bp 1 agcaaaagca ggggatacaa aatgaatact caaattttggcattcattgc 51 ttgtatgctg attggaacta aaggagacaa aatatgtctt gggcaccatg 101ctgtggcaaa tgggacaaaa gtgaacacac taacagagag gggaattgaa 151 gtagtcaatgccacggagac ggtggaaact gtaaatatta aaaaaatatg 201 cactcaagga aaaaggccaacagatctggg acaatgtgga cttctaggaa 251 ccctaatagg acctccccaa tgcgatcaatttctggagtt tgacgctaat 301 ttgataattg aacgaagaga aggaaccgat gtgtgctatcccgggaagtt 351 cacaaatgaa gaatcactga ggcagatcct tcgagggtca ggaggaattg401 ataaagagtc aatgggtttc acctatagtg gaataagaac caatggggcg 451acgagtgcct gcagaagatc aggttcttct ttctatgcgg agatgaagtg 501 gttactgtcgaattcagaca atgcggcatt tccccaaatg actaagtcgt 551 ataggaatcc caggaacaaaccagctctga taatctgggg agtgcatcac 601 tctggatcag ctactgagca gaccaaactctatggaagtg gaaacaagtt 651 gataacagta ggaagctcga aataccagca atcattcactccaagtccgg 701 gagcacggcc acaagtgaat ggacaatcag gaaggattga ttttcattgg751 ctactccttg accccaatga cacagtgacc ttcactttca atggggcatt 801catagcccct gacagggcaa gtttctttag aggagaatcg ctaggagtcc 851 agagtgatgttcctttggat tctggttgtg aaggggattg cttccacagt 901 gggggtacga tagtcagttccctgccattc caaaacatca accctagaac 951 agtggggaaa tgccctcgat atgtcaaacagacaagcctc cttttggcta 1001 caggaatgag aaacgtccca gagaacccca agcaggcctaccggaaacgg 1051 atgaccagag gcctttttgg agcgattgct ggattcatag agaatggatg1101 ggaaggtctc atcgatggat ggtatggttt cagacatcaa aatgcacaag 1151gagaaggaac tgcagctgac tacaaaagca cccaatctgc aatagatcag 1201 atcacaggcaaattgaatcg tctgattgac aaaacaaacc agcagtttga 1251 actgatagac aatgaattcagtgagataga acaacaaatc gggaatgtca 1301 ttaactggac acgagactca atgactgaggtatggtcgta taatgctgag 1351 ctgttggtgg caatggagaa tcagcataca atagatcttgcagactcaga 1401 aatgaacaaa ctttacgaac gcgtcagaaa acaactaagg gaaaatgctg1451 aagaagatgg aactggatgc tttgagatat tccataagtg tgatgatcag 1501tgtatggaga gcataaggaa caacacttat gaccataccc aatacaggac 1551 agagtcattgcagaatagaa tacagataga cccagtgaaa ttgagtagtg 1601 gatacaaaga cataatcttatggtttagct tcggggcatc atgttttctt 1651 cttctagcca ttgcaatggg attggttttcatttgcataa agaatggaaa 1701 catgcggtgc actatttgta tatagtttga gaaaaaaacacccttgtttc 1751 tact

Amino acid sequence of A/ck/BC/CN-7/04 H7 (SEQ ID NO: 29) Entiremolecule length: 567 aa 1 mntqilafia cmligtkgdk iclghhavan gtkvntltergievvnatet 51 vetvnikkic tqgkrptdlg qcgllgtlig ppqcdqflef danliierre 101gtdvcypgkf tneeslrqil rgsggidkes mgftysgirt ngatsacrrs 151 gssfyaemkwllsnsdnaaf pqmtksyrnp rnkpaliiwg vhhsgsateq 201 tklygsgnkl itvgsskyqqsftpspgarp qvngqsgrid fhwllldpnd 251 tvtftfngaf iapdrasffr geslgvqsdvpldsgcegdc fhsggtivss 301 lpfqninprt vgkcpryvkq tslllatgmr nvpenpkqayrkrmtrglfg 351 aiagfiengw eglidgwygf rhqnaqgegt aadykstqsa idqitgklnr401 lidktnqqfe lidnefseie qqignvinwt rdsmtevwsy naellvamen 451qhtidladse mnklyervrk qlrenaeedg tgcfeifhkc ddqcmesirn 501 ntydhtqyrteslqnriqid pvklssgykd iilwfsfgas cflllaiamg 551 lvficikngn mrctici

Nucleotide Sequence of A/ck/BC/CN-7/04 N3 (SEQ ID NO: 24) Entiremolecule length: 1453 nt 1 agcaaaagca ggtgcgagat gaatccgaat cagaagataataacaatcgg 51 ggtagtgaat accactctgt caacaatagc ccttctcatt ggagtgggaa 101acttagtttt caacacagtc atacatgaga aaataggaga ccatcaaata 151 gtgacccatccaacaataat gacccctgaa gtaccgaact gcagtgacac 201 tataataaca tacaataacactgttataaa caacataaca acaacaataa 251 taactgaagc agaaaggcct ttcaagtctccactaccgct gtgccccttc 301 agaggattct tcccttttca caaggacaat gcaatacgactgggtgaaaa 351 caaagacgtc atagtcacaa gggagcctta tgttagctgc gataatgaca401 actgctggtc ctttgctctc gcacaaggag cattgctagg gactaaacat 451agcaatggga ccattaaaga cagaacacca tataggtctc taattcgttt 501 cccaataggaacagctccag tactaggaaa ttacaaagag atatgcattg 551 cttggtcgag cagcagttgctttgacggga aagagtggat gcatgtgtgc 601 atgacaggga atgataatga tgcaagtgcccagataatat atggaggaag 651 aatgacagac tccattaaat catggaggaa ggacatactaagaacccagg 701 agtctgaatg tcaatgcatt gacgggactt gtgttgttgc tgtcacagat751 ggccctgctg ctaatagtgc agatcacagg gtttactgga tacgggaggg 801aagaataata aagtatgaaa atgttcccaa aacaaagata caacacttag 851 aagaatgttcctgctatgtg gacattgatg tttactgtat atgtagggac 901 aattggaagg gctctaacagaccttggatg agaatcaaca acgagactat 951 actggaaaca ggatatgtat gtagtaaatttcactcagac acccccaggc 1001 cagctgaccc ttcaataatg tcatgtgact ccccaagcaatgtcaatgga 1051 ggacccggag tgaaggggtt tggtttcaaa gctggcaatg atgtatggtt1101 aggtagaaca gtgtcaacta gtggtagatc gggctttgaa attatcaaag 1151ttacagaagg gtggatcaac tctcctaacc atgtcaaatc aattacacaa 1201 acactagtgtccaacaatga ctggtcaggc tattcaggta gcttcattgt 1251 caaagccaag gactgttttcagccctgttt ttatgttgag cttatacgag 1301 ggaggcccaa caagaatgat gacgtctcttggacaagtaa tagtatagtt 1351 actttctgtg gactagacaa tgaacctgga tcgggaaattggccagatgg 1401 ttctaacatt gggtttatgc ccaagtaata gaaaaaagca ccttgtttct1451 act

Amino acid sequence of A/ck/BC/CN-7/04 N3 (SEQ ID NO: 30) Entiremolecule length: 469 aa 1 mnpnqkiiti gvvnttlsti alligvgnlv fntvihekigdhqivthpti 51 mtpevpncsd tiitynntvi nnitttiite aerpfksplp lcpfrgffpf 101hkdnairlge nkdvivtrep yvscdndncw sfalaqgall gtkhsngtik 151 drtpyrslirfpigtapvlg nykeiciaws ssscfdgkew mhvcmtgndn 201 dasaqiiygg rmtdsikswrkdilrtqese cqcidgtcvv avtdgpaans 251 adhrvywire griikyenvp ktkiqhleecscyvdidvyc icrdnwkgsn 301 rpwmrinnet iletgyvcsk fhsdtprpad psimscdspsnvnggpgvkg 351 fgfkagndvw lgrtvstsgr sgfeiikvte gwinspnhvk sitqtlvsnn401 dwsgysgsfi vkakdcfqpc fyvelirgrp nknddvswts nsivtfcgld 451nepgsgnwpd gsnigfmpkca A/ck/BC/CN-6/04

Nucleotide sequence of ca A/ck/BC/CN-6/04 H7 (SEQ ID NO: 25) Entiremolecule length: 1734 bp 1 agcgaaagca ggggatacaa aatgaatact caaattttggcattcattgc 51 ttgtatgctg attggaacta aaggagacaa aatatgtctt gggcaccatg 101ctgtggcaaa tgggacaaaa gtgaacacac taacagagag gggaattgaa 151 gtagtcaatgccacggagac ggtggaaact gtaaatatta agaaaatatg 201 cactcaagga aaaaggccaacagatctggg acaatgtgga cttctaggaa 251 ccctaatagg acctccccaa tgcgatcaatttctggagtt tgacgctaat 301 ttgataattg aacgaagaga aggaaccgat gtgtgctatcccgggaagtt 351 cacaaatgaa gaatcactga ggcagatcct tcgagggtca ggaggaattg401 ataaagagtc aatgggtttc acctatagtg gaataagaac caatggggcg 451acgagtgcct gcagaagatc aggttcttct ttctatgcgg agatgaagtg 501 gttactgtcgaattcagaca atgcggcatt tccccaaatg actaagtcgt 551 ataggaatcc caggaacaaaccagctctga taatctgggg agtgcatcac 601 tctggatcag ctactgagca gaccaaactctatggaagtg gaaacaagtt 651 gataacagta ggaagctcga aataccagca atcattcactccaagtccgg 701 gagcacggcc acaagtgaat ggacaatcag gaaggattga ttttcattgg751 ctactccttg accccaatga cacagtgacc ttcactttca atggggcatt 801catagcccct gacagggcaa gtttctttag aggagaatcg ctaggagtcc 851 agagtgatgttcctttggat tctggttgtg aaggggattg cttccacagt 901 gggggtacga tagtcagttccctgccattc caaaacatca accctagaac 951 agtggggaaa tgccctcgat atgtcaaacagacaagcctc cttttggcta 1001 caggaatgag aaacgtccca gagaacccca agaccagaggcctttttgga 1051 gcgattgctg gattcataga gaatggatgg gaaggtctca tcgatggatg1101 gtatggtttc agacatcaaa atgcacaagg agaaggaact gcagctgact 1151acaaaagcac ccaatctgca atagatcaga tcacaggcaa attgaatcgt 1201 ctgattgacaaaacaaacca gcagtttgaa ctgatagaca atgaattcag 1251 tgagatagaa caacaaatcgggaatgtcat taactggaca cgagactcaa 1301 tgactgaggt atggtcgtat aatgctgagctgttggtggc aatggagaat 1351 cagcatacaa tagatcttgc agactcagaa atgaacaaactttacgaacg 1401 cgtcagaaaa caactaaggg aaaatgctga agaagatgga actggatgct1451 ttgagatatt ccataagtgt gatgatcagt gtatggagag cataaggaac 1501aacacttatg accataccca atacaggaca gagtcattgc agaatagaat 1551 acagatagacccagtgaaat tgagtagtgg atacaaagac ataatcttat 1601 ggtttagctt cggggcatcatgttttcttc ttctagccat tgcaatggga 1651 ttggttttca tttgcataaa gaatggaaacatgcggtgca ctatttgtat 1701 atagtttgag aaaaaaacac ccttgtttct act

Amino acid sequence of ca A/ck/BC/CN-6/04 H7 (SEQ ID NO: 31) Entiremolecule length: 560 aa 1 mntqilafia cmligtkgdk iclghhavan gtkvntltergievvnatet 51 vetvnikkic tqgkrptdlg qcgllgtlig ppqcdqflef danliierre 101gtdvcypgkf tneeslrqil rgsggidkes mgftysgirt ngatsacrrs 151 gssfyaemkwllsnsdnaaf pqmtksyrnp rnkpaliiwg vhhsgsateq 201 tklygsgnkl itvgsskyqqsftpspgarp qvngqsgrid fhwllldpnd 251 tvtftfngaf iapdrasffr geslgvqsdvpldsgcegdc fhsggtivss 301 lpfqninprt vgkcpryvkq tslllatgmr nvpenpktrglfgaiagfie 351 ngweglidgw ygfrhqnaqg egtaadykst qsaidqitgk lnrlidktnq401 qfelidnefs eieqqignvi nwtrdsmtev wsynaellva menqhtidla 451dsemnklyer vrkqlrenae edgtgcfeif hkcddqcmes irnntydhtq 501 yrteslqnriqidpvklssg ykdiilwfsf gascflllai amglvficik 551 ngnmrctici

Nucleotide sequence of ca A/ck/BC/CN-6/04 N3 (SEQ ID NO: 26) Entiremolecule length: 1453 nt 1 agcaaaagca ggtgcgagat gaatccgaat cagaagataataacaatcgg 51 ggtagtgaat accactctgt caacaatagc ccttctcatt ggagtgggaa 101acttagtttt caacacagtc atacatgaga aaataggaga ccatcaaata 151 gtgacccatccaacaataat gacccctgaa gtaccgaact gcagtgacac 201 tataataaca tacaataacactgttataaa caacataaca acaacaataa 251 taactgaagc agaaaggcct ttcaagtctccactaccgct gtgccccttc 301 agaggattct tcccttttca caaggacaat gcaatacgactgggtgaaaa 351 caaagacgtc atagtcacaa gggagcctta tgttagctgc gataatgaca401 actgctggtc ctttgctctc gcacaaggag cattgctagg gactaaacat 451agcaatggga ccattaaaga cagaacacca tataggtctc taattcgttt 501 cccaataggaacagctccag tactaggaaa ttacaaagag atatgcattg 551 cttggtcgag cagcagttgctttgacggga aagagtggat gcatgtgtgc 601 atgacaggga atgataatga tgcaagtgcccagataatat atggaggaag 651 aatgacagac tccattaaat catggaggaa ggacatactaagaacccagg 701 agtctgaatg tcaatgcatt gacgggactt gtgttgttgc tgtcacagat751 ggccctgctg ctaatagtgc agatcacagg gtttactgga tacgggaggg 801aagaataata aagtatgaaa atgttcccaa aacaaagata caacacttag 851 aagaatgttcctgctatgtg gacattgatg tttactgtat atgtagggac 901 aattggaagg gctctaacagaccttggatg agaatcaaca acgagactat 951 actggaaaca ggatatgtat gtagtaaatttcactcagac acccccaggc 1001 cagctgaccc ttcaataatg tcatgtgact ccccaagcaatgtcaatgga 1051 ggacccggag tgaaggggtt tggtttcaaa gctggcaatg atgtatggtt1101 aggtagaaca gtgtcaacta gtggtagatc gggctttgaa attatcaaag 1151ttacagaagg gtggatcaac tctcctaacc atgtcaaatc aattacacaa 1201 acactagtgtccaacaatga ctggtcaggc tattcaggta gcttcattgt 1251 caaagccaag gactgttttcagccctgttt ttatgttgag cttatacgag 1301 ggaggcccaa caagaatgat gacgtctcttggacaagtaa tagtatagtt 1351 actttctgtg gactagacaa tgaacctgga tcgggaaattggccagatgg 1401 ttctaacatt gggtttatgc ccaagtaata gaaaaaagca ccttgtttct1451 act

Amino acid sequence of ca A/ck/BC/CN-6/04 N3 (SEQ ID NO: 32) Entiremolecule length: 469 aa 1 mnpnqkiiti gvvnttlsti alligvgnlv fntvihekigdhqivthpti 51 mtpevpncsd tiitynntvi nnitttiite aerpfksplp lcpfrgffpf 101hkdnairlge nkdvivtrep yvscdndncw sfalaqgall gtkhsngtik 151 drtpyrslirfpigtapvlg nykeiciaws ssscfdgkew mhvcmtgndn 201 dasaqiiygg rmtdsikswrkdilrtqese cqcidgtcvv avtdgpaans 251 adhrvywire griikyenvp ktkiqhleecscyvdidvyc icrdnwkgsn 301 rpwmrinnet iletgyvcsk fhsdtprpad psimscdspsnvnggpgvkg 351 fgfkagndvw lgrtvstsgr sgfeiikvte gwinspnhvk sitqtlvsnn401 dwsgysgsfi vkakdcfqpc fyvelirgrp nknddvswts nsivtfcgld 451nepgsgnwpd gsnigfmpkca A/Duck

Nucleotide sequence of ca A/Duck H6 (SEQ ID NO: 33) Entire moleculelength: 1743 bp 1 agcaaaagca ggggaaaatg attgcagtca ttataatagc ggtactggcaacggccggaa 61 aatcagacaa gatctgcatt gggtatcatg ccaacaattc aacaacacaagtggatacga 121 tacttgagaa gaatgtaacc gtcacacact cagttgaatt gctggagaaccaaaaagaag 181 aaagattctg caagatcttg aacaaggccc ctctcgattt aagaggatgtaccatagagg 241 gttggatctt ggggaatccc caatgcgacc tattgcttgg tgatcaaagctggtcatata 301 tagtggaaag acctacagct caaaatggga tctgctaccc aggaattttgaatgaagtag 361 aagaactgaa ggcacttatt ggatcaggag aaagagtgga gagatttgaaatgtttccca 421 aaagtacatg ggcaggagta gacaccagca gtggggtaac aaaggcttgcccttatacta 481 gtggttcgtc tttctacaga aacctcctat ggataataaa aaccaagtccgcagcatatc 541 cagtaattaa gggaacctac aataacactg gaagccagcc aatcctctatttctggggtg 601 tgcaccatcc tcctgacacc aatgagcaaa acactttgta tggctctggtgatcgatatg 661 tcaggatggg aactgaaagc atgaattttg ccaagagccc agaaattgcggcaaggcctg 721 ctgtgaatgg tcaaagaggc agaattgatt attactggtc tgttttaaagccgggggaaa 781 ccttgaatgt ggaatctaat ggaaatctaa tcgccccttg gtatgcatacaaatttgtca 841 gcaccaatag taaaggagcc gtcttcaagt caaatttacc aatcgagaactgtgatgcca 901 catgccagac tattgcagga gtcttaagaa ccaataaaac atttcagaatgtaagccctc 961 tgtggatagg agaatgcccc aaatatgtga aaagtgaaag tttgaggcttgcaactggac 1021 taagaaatat tccacagatt gagactagag gacttttcgg agctatcgcagggtttattg 1081 aaggaggatg gactggaatg atagatgggt ggtatggcta tcaccatgaaaattctcaag 1141 gctcagggta tgcggcagac agagaaagca ctcaaagggc tatagacggaattacaaata 1201 aggtcaattc cattatagac aaaatgaaca cacaattcga agctatagaccacgaattct 1261 caaatttgga gagaagaatt gacagtctga acaaaagaat ggaagatggatttctggacg 1321 tttggacata caatgctgaa ctgttggttc ttcttgaaaa cgaaaggacactagacctac 1381 atgacgcgaa tgtgaagaac ctgtatgaaa aggtcaaatc acaactacgggacaatgcta 1441 atgatctagg aaatggatgc tttgaatttt ggcataagtg tgacaatgaatgcatagagt 1501 ctgtcaaaaa tggtacctat gactatccca aatatcagga tgaaagcaaattgaacaggc 1561 aggaaataga atcggtgaag ctggagaacc ttggtgtgta tcaaatcctcgccatttata 1621 gtacggtatc gagcagtcta gtcttggtag ggctgattat agcaatgggtctttggatgt 1681 gttcaaatgg ttcaatgcaa tgcaggatat gtatataatt aagaaaaacacccttgttct 1741 act

Amino acid sequence of ca A/Duck H6 (SEQ ID NO: 39) Entire moleculelength: 566 aa 1 miaviiiavl atagksdkic igyhannstt qvdtileknv tvthsvellenqkeerfcki 61 lnkapldlrg ctiegwilgn pqcdlllgdq swsyiverpt aqngicypgilneveelkal 121 igsgerverf emfpkstwag vdtssgvtka cpytsgssfy rnllwiiktksaaypvikgt 181 ynntgsqpil yfwgvhhppd tneqntlygs gdryvrmgte smnfakspeiaarpavngqr 241 gridyywsvl kpgetlnves ngnliapwya ykfvstnskg avfksnlpiencdatcqtia 301 gvlrtnktfq nvsplwigec pkyvkseslr latglrnipq ietrglfgaiagfieggwtg 361 midgwygyhh ensqgsgyaa drestqraid gitnkvnsii dkmntqfeaidhefsnlerr 421 idslnkrmed gfldvwtyna ellvllener tldlhdanvk nlyekvksqlrdnandlgng 481 cfefwhkcdn eciesvkngt ydypkyqdes klnrqeiesv klenlgvyqilaiystvsss 541 lvlvgliiam glwmcsngsm qcrici

Nucleotide sequence of ca A/Duck N9 (SEQ ID NO: 34) Entire moleculelength: 1460 nt 1 agcaaaagca gggtcaagat gaatccaaat cagaagattc tatgcacatctgctactgcc 61 attgcaatag gcacaattgc tgtattaata ggaatagcaa acctgggtttgaacatagga 121 ctacacctga aaccgagctg caactgctcc aaccctcctc ctgaaacaacaaatgtaagc 181 caaacaataa taaacaatta ctacaatgaa acaaatgtta cccaaataagtaacacaaac 241 attcaacata tggggggaac cgaaaaggac ttcaacaatc tgactaaagggctctgcaca 301 ataaattcat ggcatatatt cggaaaggac aatgctataa gaataggggagaactctgat 361 gttttagtca caagagagcc atatgtttct tgtgatccag atgaatgcagattctatgct 421 ctcagccaag gaacaacaat acggggaaag cactcaaatg gaacaatacacgatagatcc 481 caataccgtg ctttagtgag ctggccttta tcatcaccac ccactgtgtacaataccaga 541 gtagaatgca ttggatggtc cagtacaagc tgccatgatg ggaaagcacgaatgtctata 601 tgtgtctcag gtcccaacaa caatgcatca gcagtgattt ggtacaaagggcggcctatc 661 acggaaatca atacgtgggc ccgaaacata ttgagaaccc aagaatctgagtgtgtatgc 721 cacaatggaa tatgtccagt agtgttcact gacggttctg ccaccggtccagcagaaact 781 aggatatact atttcaaaga ggggaaaatc ctcaaatggg agccactaactggaaccgcc 841 aagcacattg aagaatgctc ttgctatggg aaagactcag aaataacgtgcacatgtaga 901 gacaattggc aaggctcgaa tagaccagta atacaaataa accccacaatgatgactcac 961 actagtcaat acatatgcag ccctgtcctc acagacaatc cacgccccaatgaccccacg 1021 gtaggcaagt gtaatgatcc ttatccagga aacaacaata atggagtcaaaggattctca 1081 tatttagatg gtgacaatac atggctagga agaacgataa gcacagcctctaggtctggg 1141 tatgaaatgc tgaaagtgcc taatgcattg acagatgata gatcaaaacctactcaaggt 1201 cagacaattg tattaaacac agactggagt ggttacagtg ggtctttcattgattactgg 1261 gcaaaagggg agtgctatag agcatgcttc tacgttgagc tgatccgtggaaggccaaaa 1321 gaggacaaag tgtggtggac cagtaatagt atagtgtcga tgtgttccagcacagagttc 1381 cttggacaat ggaactggcc agatggggct aaaatagagt acttcctctaagatgtagaa 1441 aaaagaccct tgtttctact

Amino acid sequence of ca A/Duck N9 (SEQ ID NO: 40) Entire moleculelength: 470 aa 1 mnpnqkilct sataiaigti avilgianlg lniglhlkps cncsnpppettnvsqtiinn 61 yynetnvtqi sntniqhmgg tekdfnnltk glctinswhi fgkdnairigensdvlvtre 121 pyvscdpdec rfyalsqgtt irgkhsngti hdrsqyralv swplsspptvyntrvecigw 181 sstschdgka rmsicvsgpn nnasaviwyk grpiteintw arnilrtqesecvchngicp 241 vvftdgsatg paetriyyfk egkilkwepl tgtakhieec scygkdseitctcrdnwqgs 301 nrpviqinpt mmthtsqyic spvltdnprp ndptvgkcnd pypgnnnngvkgfsyldgdn 361 twlgrtista srsgyemlkv pnaltddrsk ptqgqtivln tdwsgysgsfidywakgecy 421 racfyvelir grpkedkvww tsnsivsmcs steflgqwnw pdgakieyflca A/Teal

Nucleotide sequence of ca A/Teal H6 (SEQ ID NO: 35) Entire moleculelength: 1747 bp 1 agcaaaagca ggggaaaatg attgcaatca ttgtaatagc aatactggcagcagccggaa 61 aatcagacaa gatctgcatt gggtatcatg ccaacaattc aacaacacaggtagatacga 121 tacttgagaa gaatgtgact gtcacacact caattgaatt gctggaaaatcagaaggaag 181 aaagattctg caagatattg aacaaggccc ctctcgactt aagggaatgtaccatagagg 241 gttggatctt ggggaatccc caatgcgacc tattgcttgg tgatcaaagctggtcataca 301 ttgtggaaag acctactgct caaaacggga tctgctaccc aggaaccttaaatgaggtag 361 aagaactgag ggcacttatt ggatcaggag aaagggtaga gagatttgagatgtttcccc 421 aaagcacctg gcaaggagtt gacaccaaca gtggaacaac aagatcctgcccttattcta 481 ctggtgatcc gtctttctac agaaacctcc tatggataat aaaaaccaagacagcagaat 541 atccagtaat taagggaatt tacaacaaca ctggaaccca gccaatcctctatttctggg 601 gtgtgcatca tcctcctaac accgacgagc aagatactct gtatggctctggtgatcgat 661 acgttagaat gggaactgaa agcatgaatt ttgccaagag tccggaaattgcggcaaggc 721 ctgctgtgaa tggacaaaga ggcagaattg attattattg gtcggttttaaaaccagggg 781 aaaccttgaa tgtggaatct aatggaaatc taatcgcccc ttggtatgcatacaaatttg 841 tcaacacaaa tagtaaagga gccgtcttca ggtcagattt accaatcgagaactgcgatg 901 ccacatgcca gactattgca ggggttctaa ggaccaataa aacatttcagaatgtgagtc 961 ccctgtggat aggagaatgt cccaaatacg tgaaaagtga aagtctgaggcttgcaactg 1021 gactaagaaa tgttccacag attgaaacta gaggactctt cggagctattgcagggttta 1081 ttgaaggagg atggactggg atgatagatg ggtggtatgg ctatcaccatgaaaattctc 1141 aagggtcagg atatgcagcg gacagagaaa gcactcaaaa ggctgtaaacagaattacaa 1201 ataaggtcaa ttccatcatc aacaaaatga acacacaatt tgaagctgtcgatcacgaat 1261 tttcaaatct ggagaggaga atcgacaatc tgaacaaaag aatgcaagatggatttctgg 1321 atgtttggac atacaatgct gaactgttgg ttcttcttga aaacgaaagaacactagaca 1381 tgcatgacgc aaatgtgaag aacctacatg aaaaggtcaa atcacaactaagggacaatg 1441 ctaacgatct agggaatggt tgctttgaat tttggcataa gtgtgacaatgaatgcatag 1501 agtctgtcaa aaatggtaca tatgactatc ccaaatacca gactgaaagcaaattaaaca 1561 ggctaaaaat agaatcagta aagctagaga accttggtgt gtatcaaattcttgccattt 1621 atagtacggt atcgagcagc ctagtgttgg tagggctgat catggcaatgggtctttgga 1681 tgtgttcaaa tggttcaatg cagtgcaatg tgtgtatatg attaagaaaaacacccttgt 1741 ttctact

Amino acid sequence of ca A/Teal H6 (SEQ ID NO: 41) Entire moleculelength: 567 aa 1 miaiiviail aaagksdkic igyhannstt qvdtileknv tvthsiellenqkeerfcki 61 inkapldlre ctiegwilgn pqcdlllgdq swsyiverpt aqngicypgtlneveelral 121 igsgerverf emfpqstwqg vdtnsgttrs cpystgdpsf yrnllwiiktktaeypvikg 181 iynntgtqpi lyfwgvhhpp ntdeqdtlyg sgdryvrmgt esmnfakspeiaarpavngq 241 rgridyywsv lkpgetlnve sngnliapwy aykfvntnsk gavfrsdlpiencdatcqti 301 agvlrtnktf qnvsplwige cpkyvksesl rlatglrnvp qietrglfgaiagfieggwt 361 gmidgwygyh hensqgsgya adrestqkav nritnkvnsi inkmntqfeavdhefsnler 421 ridnlnkrmq dgfldvwtyn aellvllene rtldmhdanv knlhekvksqlrdnandlgn 481 gcfefwhkcd neciesvkng tydypkyqte sklnrlkies vklenlgvyqilaiystvss 541 slvlvglima mglwmcsngs mqcnvci

Nucleotide sequence of ca A/Teal N1 (SEQ ID NO: 36)Entire molecule length: 1401 nt 1agcaaaagca ggagtttaac atgaatccaa atcagaagat aataaccatt gggtcaatct 61gtatggtagt tggaataatc agcttgatgt tacaaattgg aaacataata tcaatatggg 121ttagccacat aattcagact gggcatccaa accagcctgg gccatgcaat caaagcatca 181atttttacac tgagcaggct gcagcttcag tgacattagc gggtaattcc tctctctgcc 241ctattagtgg atgggctata tacagtaaag acaatagtat aagaattggt tccaaagggg 301atgtgtttgt tatgagagaa ccattcgttt catgctccca tttggaatgc agaacctttt 361tcttgactca aggagcccta ttgaatgaca agcattctaa tgggaccgtt aaagacagaa 421gcccctatag aactttaatg agctgtcctg ttggtgaggc tccttcccca tacaactcaa 481ggtttgagtc tgttgcttgg tcagcaagtg cttgccatga tggcattagt tggctaacaa 541ttggaatttc cggtccggat aatggggctg tggctgtgtt gaaatacaat ggcataataa 601cagacaccat caagagttgg aggaacaaca tactgaggac acaagagtct gaatgtgcat 661gtgtgaatgg ttcttgtttt actgtaatga cagatggacc gagtaatgaa caggcctcat 721acaagatttt caagatagag aaggggaaag tagtcaaatc agttgagttg aacgccccta 781attatcatta cgaggaatgc tcctgttatc ctgatgctgg cgaaatcaca tgtgtgtgca 841gggataattg gcatggctcg aaccgaccgt gggtgtcttt caatcagaat ctggagtatc 901aaataggata tatatgcagt ggggttttcg gagacagtcc acgccccaat gatggaacag 961gcagttgcgg tccagtgtct cttaacggag agtatggagt aaaagggttt tcatttaagt 1021acggtgatgg tgtttggatc gggagaacca aaagcactag ttccaggagc gggtttgaaa 1081tgatttggga tccaaatggg tggaccgaaa cagatagtaa cttctcattg aagcaagaca 1141tcatagcaat aactgattgg tcaggataca gcgggagttt tgtccaacat ccagaactga 1201caggattaaa ttgcatgagg ccttgcttct gggttgaact aatcagaggg aggcccaaag 1261agaaaacaat ctggactagt gggagcagta tatctttctg tggtgtaaat agtgacactg 1321tgggttggtc ttggccagac ggtgctgagg tgccattcac cattgacaag tagtttgttc 1381aaaaaactcc ttgtttctac t

Amino acid sequence of ca A/Teal N1 (SEQ ID NO: 42)Entire molecule length: 450 aa 1mnpnqkiiti gsicmvvgii slmlqignii siwvshiiqt ghpnqpgpcn qsinfyteqa 61aasvtlagns slcpisgwai yskdnsirig skgdvfvmre pfvscshlec rtffltqgal 121lndkhsngtv kdrspyrtlm scpvgeapsp ynsrfesvaw sasachdgis wltigisgpd 181ngavavlkyn giitdtiksw rnnilrtqes ecacvngscf tvmtdgpsne qasykifkie 241kgkvvksvel napnyhyeec scypdageit cvcrdnwhgs nrpwvsfnqn leyqigyics 301gvfgdsprpn dgtgscgpvs lngeygvkgf sfkygdgvwi grtkstssrs gfemiwdpng 361wtetdsnfsl kqdiiaitdw sgysgsfvqh peltglncmr pcfwvelirg rpkektiwts 421gssisfcgvn sdtvgwswpd gaevpftidkca A/Mallard

Nucleotide sequence of ca A/Mallard H6 (SEQ ID NO: 37)Entire molecule length: 1745 bp 1agcaaaagca ggggaaaatg attgcaatca taatacttgc aatagtggtc tctaccagca 61agtcagacag gatctgcatt ggttaccatg caaacaactc gacaacacaa gtggacacaa 121tattagagaa gaatgtgaca gtgacacact cagtggagct cctagaaaac cagaaggaga 181atagattctg cagagtcttg aataaagcgc cactggatct aatggactgc accactgagg 241gttggatcct tggaaacccc cgatgtgata acttactcgg tgatcaaagt tggtcataca 301tagtagagag gcctgatgcc caaaatggga tatgttaccc aggggtattg aaggagacgg 361aagagctgaa agcactcatt gggtctatag atagcataca aagatttgaa atgtttccca 421agagcacgtg gaccggggta gatactaata gcggagttac gagcgcttgc ccctacaatg 481gtgaatcttc cttttacagg aatctgttgt ggataataaa aataagatct gatccgtact 541cattgatcaa ggggacatat accaatacag gctctcagcc aatcttatat ttctggggtg 601tgcaccatcc tccagatgaa gttgagcaag ctaacttgta tggaattggt acccggtatg 661ttaggatggg aactgaaagt atgaattttg ccaaaggtcc tgaaatagca ggcagaccac 721ctgcgaatgg gcaacgagga agaattgatt attattggtc tgtgttgaag ccaggagaaa 781ccttgaatgt ggaatccaat ggaaatttaa tagctccttg gtatgcttac aagttcacta 841gttccagaaa caagggagct attttcaaat cagaccttcc aattgagaat tgtgatgctg 901tctgtcaaac tttagctgga gcaataaata caaacaaaac cttccaaaat attagtccag 961tctggattgg agaatgcccc aaatatgtta aaagtaagag cctaaaacta gcaactggtc 1021tgagaaatgt tccacaggca gaaacaagag gattgtttgg agcaatagct gggtttatag 1081aaggaggatg gacaggtatg gtagacggat ggtacggata ccaccatgaa aattcacagg 1141ggtctggtta tgcagcagat aaagaaagca ctcagaaagc aatagacggg atcaccaata 1201aagtcaattc aatcattgac aaaatgaaca cacaatttga ggcagtagag catgagttct 1261caagtctcga aaggagaata ggcaatctga acaaaagaat ggaagatgga tttttagacg 1321tgtggacata caatgctgaa cttctggttc tactggaaaa tgagaggact ttggacatgc 1381atgatgctaa tgtaaagaat ctacatgaaa aggtgaaatc acaattaagg gataatgcaa 1441aggatttggg taatgggtgt tttgaatttt ggcacaaatg cgacaatgaa tgcatcaact 1501cagttaaaaa tggcacatat gactacccaa agtaccagga agagagcaga cttaataggc 1561aggaaataaa atcagtgatg ctggaaaatc tgggagtata ccaaatcctt gctatttata 1621gtacggtatc gagcagtctg gttttggtgg gactgatcat tgccatgggt ctttggatgt 1681gctcaaatgg ctcaatgcaa tgcaagatat gtatataatt agaaaaaaac acccttgttt 1741ctact

Amino acid sequence of ca A/Mallard H6 (SEQ ID NO: 43)Entire molecule length: 566 aa 1miaiiilaiv vstsksdric igyhannstt qvdtileknv tvthsvelle nqkenrfcrv 61lnkapldlmd cttegwilgn prcdnllgdq swsyiverpd aqngicypgv lketeelkal 121igsidsiqrf emfpkstwtg vdtnsgvtsa cpyngessfy rnllwiikir sdpyslikgt 181ytntgsqpil yfwgvhhppd eveqanlygi gtryvrmgte smnfakgpei agrppangqr 241gridyywsvl kpgetlnves ngnliapwya ykftssrnkg aifksdlpie ncdavcqtla 301gaintnktfq nispvwigec pkyvkskslk latglrnvpq aetrglfgai agfieggwtg 361mvdgwygyhh ensqgsgyaa dkestqkaid gitnkvnsii dkmntqfeav ehefsslerr 421ignlnkrmed gfldvwtyna ellvllener tldmhdanvk nlhekvksql rdnakdlgng 481cfefwhkcdn ecinsvkngt ydypkyqees rlnrqeiksv mlenlgvyqi laiystvsss 541lvlvgliiam glwmcsngsm qckici

Nucleotide sequence of ca A/Mallard N2 (SEQ ID NO: 38)Entire molecule length: 1467 nt 1agcaaaagca ggagtgaaaa tgaatccaaa tcagaggata ataacaattg gatccgtctc 61tctaactatt gcaacagtgt gtttcctcat gcagattgcc atcctagcaa cgactgtgac 121actgcatttc aaacaaaatg aatgcagcat tcccgcaaac aaccaagtaa cgccatgtga 181accaatagta atagagagga acataacaga gatagtgtat ttgaataata ctaccataga 241aaaagagatt tgtcctgaag tagtagaata caggaattgg tcaaaaccgc aatgtcaaat 301tacagggttt gctcctttct ccaaggacaa ctcaattcgg ctttctgctg gtggggacat 361ttggataaca agagaacctt atgtgtcatg cgaccccagt aaatgttatc aatttgcact 421cgggcagggg accacgctgg acaacaaaca ctcaaatggc acaatacatg atagaatccc 481tcatcggacc cttttgatga atgaattggg tgttccgttt catttgggaa ccaaacaagt 541gtgcatagca tggtccagct caagctgtca tgatgggaaa gcatggttgc acgtttgtgt 601cactggggat gatagaaatg caactgctag tttcatttat gatgggatgc ttattgacag 661tattggttcc tggtctcaaa atatcctcag gactcaggag tcagaatgcg tttgtatcag 721tggaacttgt acagtagtaa tgactgatgg aagtgcatca ggaagggcag acactagaat 781actattcatt agagagggga aaattgtcca cattagtcca ttgtcaggaa gtgctcagca 841tgtagaggaa tgttcttgtt atccccggta cccaaacgtc agatgtgtct gcagagacaa 901ctggaagggc tctaataggc ccgttataga tataaatatg gcagattata gcattgactc 961aagttatgtg tgctcaggac ttgttggaga cacaccaagg aacgatgata gctctagcag 1021cagcaactgc agggatccta ataatgagag agggaaccca ggagtgaaag ggtgggcctt 1081tgataatgga aatgatgtgt ggatgggaag aacaatcagt aaagattcgc gctcaggcta 1141tgagaccttc aaggtcattg gtggttgggc cattgctaat tccaagtcac agaccaatag 1201acaagtcata gttgataata acaactggtc tggttattct ggtattttct ctgttgaaag 1261caaaggctgc atcaataggt gtttttatgt ggagttgata agaggaaggc cacaggagac 1321tagagtatgg tggacctcaa acagtattgt cgtattttgt ggcacttcag ggacatatgg 1381aacaggctca tggcctgatg gggcgaatat cgatttcatg cctatataag ctttcgcaat 1441tttagaaaaa aactccttgt ttctact

Amino acid sequence of ca A/Mallard N2 (SEQ ID NO: 44)Entire molecule length: 469 aa 1mnpnqriiti gsvsltiatv cflmqiaila ttvtlhfkqn ecsipannqv tpcepivier 61niteivylnn ttiekeicpe vveyrnwskp qcqitgfapf skdnsirlsa ggdiwitrep 121yvscdpskcy qfalgqgttl dnkhsngtih driphrtllm nelgvpfhlg tkqvciawss 181sschdgkawl hvcvtgddrn atasfiydgm lidsigswsq nilrtqesec vcisgtctvv 241mtdgsasgra dtrilfireg kivhisplsg saqhveecsc yprypnvrcv crdnwkgsnr 301pvidinmady sidssyvcsg lvgdtprndd sssssncrdp nnergnpgvk gwafdngndv 361wmgrtiskds rsgyetfkvi ggwaiansks qtnrqvivdn nnwsgysgif sveskgcinr 421cfyvelirgr pqetrvwwts nsivvfcgts gtygtgswpd ganidfmpi

SUMMARY OF SEQ ID NO DESIGNATIONS Amino Acid or SEQ ID NO HA or NASTRAIN NAME Nucleotide SEQ ID NO: 1 HA (H5) ca A/Vietnam/ Nucleotide1203/04 SEQ ID NO: 2 NA (N1) ca A/Vietnam/ Nucleotide 1203/04SEQ ID NO: 3 HA (H5) ca A/Hong Nucleotide Kong/213/03 SEQ ID NO: 4NA (N1) ca A/Hong Nucleotide Kong/213/03 SEQ ID NO: 5 HA (H5) ca A/HongNucleotide Kong/491/97 SEQ ID NO: 6 NA (N1) ca A/Hong NucleotideKong/486/97 SEQ ID NO: 7 HA (H5) ca A/Hong Nucleotide Kong/491/97(Ser211) SEQ ID NO: 8 NA (N1) ca A/Hong Nucleotide Kong/486/97SEQ ID NO: 9 HA (H9) ca A/ck/Hong Nucleotide Kong/G9/97 SEQ ID NO: 10NA (N2) ca A/ck/Hong Nucleotide Kong/G9/97 SEQ ID NO: 21 HA (H7)A/Netherlands/ Nucleotide 219/03 SEQ ID NOs: 22  NA (N7) A/Netherlands/Nucleotide & 45 219/03 SEQ ID NO: 23 HA (H7) A/ck/BC/CN- Nucleotide 7/04SEQ ID NO: 24 NA (N3) A/ck/BC/CN- Nucleotide 7/04 SEQ ID NO: 25 HA (H7)ca A/ck/BC/CN- Nucleotide 6/04 SEQ ID NO: 26 NA (N3) ca A/ck/BC/CN-Nucleotide 6/04 SEQ ID NO: 33 HA (H6) ca A/Duck Nucleotide SEQ ID NO: 34NA (N9) ca A/Duck Nucleotide SEQ ID NO: 35 HA (H6) ca A/Teal NucleotideSEQ ID NO: 36 NA (N1) ca A/Teal Nucleotide SEQ ID NO: 37 HA (H6)ca A/Mallard Nucleotide SEQ ID NO: 38 NA (N2) ca A/Mallard NucleotideSEQ ID NO: 11 HA (H5) ca A/Vietnam/ Amino Acid 1203/04 SEQ ID NO: 12NA (N1) ca A/Vietnam/ Amino Acid 1203/04 SEQ ID NO: 13 HA (H5) ca A/HongAmino Acid Kong/213/03 SEQ ID NO: 14 NA (N1) ca A/Hong Amino AcidKong/213/03 SEQ ID NO: 15 HA (H5) ca A/Hong Amino Acid Kong/491/97SEQ ID NO: 16 NA (N1) ca A/Hong Amino Acid Kong/486/97 SEQ ID NO: 17HA (H5) ca A/Hong Amino Acid Kong/491/97 (Ser211) SEQ ID NO: 18 NA (N1)ca A/Hong Amino Acid Kong/486/97 SEQ ID NO: 19 HA (H9) ca A/ck/HongAmino Acid Kong/G9/97 SEQ ID NO: 20 NA (N2) ca A/ck/Hong Amino AcidKong/G9/97 SEQ ID NO: 27 HA (H7) A/Netherlands/ Amino Acid 219/03SEQ ID NO: 28 NA (N7) A/Netherlands/ Amino Acid 219/03 SEQ ID NO: 29HA (H7) A/ck/BC/CN- Amino Acid 7/04 SEQ ID NO: 30 NA (N3) A/ck/BC/CN-Amino Acid 7/04 SEQ ID NO: 31 HA (H7) ca A/ck/BC/CN- Amino Acid 6/04SEQ ID NO: 32 NA (N3) ca A/ck/BC/CN- Amino Acid 6/04 SEQ ID NO: 39HA (H6) ca A/Duck Amino Acid SEQ ID NO: 40 NA (N9) ca A/Duck Amino AcidSEQ ID NO: 41 HA (H6) ca A/Teal Amino Acid SEQ ID NO: 42 NA (N1)ca A/Teal Amino Acid SEQ ID NO: 43 HA (H6) ca A/Mallard Amino AcidSEQ ID NO: 44 NA (N2) ca A/Mallard Amino Acid

1. A 6:2 reassortant influenza virus, wherein said virus comprises 6internal genome segments from one or more donor viruses other than A/AnnArbor/6/60 and 2 surface antigen genome segments, wherein the surfaceantigen genome segments encode an HA and a NA polypeptide, wherein thesurface antigen genome segments that encodes the HA polypeptide producesa polypeptide comprising the amino acid sequence of SEQ ID NO:29.
 2. The6:2 reassortant influenza virus of claim 1, wherein said donor viruscomprises one or more of the following phenotypes:temperature-sensitive, cold-adapted, or attenuated.
 3. The 6:2reassortant influenza virus of claim 1, wherein said donor virus is PR8.4. The 6:2 reassortant influenza virus of claim 1, wherein said donorvirus is A/Leningrad/17.
 5. An immunogenic vaccine compositioncomprising an immunologically effective amount of the reassortantinfluenza virus of claim
 1. 6. An immunogenic vaccine compositioncomprising an immunologically effective amount of the reassortantinfluenza virus of claim
 2. 7. An immunogenic vaccine compositioncomprising an immunologically effective amount of the reassortantinfluenza virus of claim
 3. 8. An immunogenic vaccine compositioncomprising an immunologically effective amount of the reassortantinfluenza virus of claim
 4. 9. A method for stimulating the immunesystem of an individual to produce a protective immune response againstinfluenza virus, the method comprising administering to the individualan immunologically effective amount of the reassortant influenza virusof claim 1 in a physiologically effective carrier.
 10. A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe reassortant influenza virus of claim 2 in a physiologicallyeffective carrier.
 11. A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the reassortant influenza virus ofclaim 3 in a physiologically effective carrier.
 12. A method forstimulating the immune system of an individual to produce a protectiveimmune response against influenza virus, the method comprisingadministering to the individual an immunologically effective amount ofthe reassortant influenza virus of claim 4 in a physiologicallyeffective carrier.
 13. A method of prophylactic or therapeutic treatmentof a viral infection in a subject, the method comprising: administeringto the subject, the virus of claim 1 in an amount effective to producean immunogenic response against the viral infection.
 14. A method ofprophylactic or therapeutic treatment of a viral infection in a subject,the method comprising: administering to the subject, the virus of claim3 in an amount effective to produce an immunogenic response against theviral infection.
 15. A method of prophylactic or therapeutic treatmentof a viral infection in a subject, the method comprising: administeringto the subject, the virus of claim 4 in an amount effective to producean immunogenic response against the viral infection.
 16. The method ofclaim 13, wherein said virus is killed or inactivated.
 17. The method ofclaim 14, wherein said virus is killed or inactivated.
 18. The method ofclaim 15, wherein said virus is killed or inactivated.
 19. The method ofclaim 9, wherein the subject is a human.
 20. The method of claim 10,wherein the subject is a human.
 21. The method of claim 11, wherein thesubject is a human.
 22. A live attenuated influenza vaccine comprisingthe composition of claim
 7. 23. A live attenuated influenza vaccinecomprising the composition of claim
 8. 24. A method for producing aninfluenza virus in cell culture, the method comprising: i) introducinginto a population of host cells, which population of host cells iscapable of supporting replication of influenza virus, a plurality ofvectors comprising nucleotide sequences corresponding to: (a) at least 6internal genome segments of a first influenza strain, wherein the firstinfluenza strain is not A/Ann Arbor/6/60; and, at least one genomesegment encoding an HA surface antigen polypeptide wherein the surfaceantigen polypeptide comprises the amino acid sequence of SEQ ID NO: 29;or (b) at least 6 internal genome segments of a first influenza strain,wherein the first influenza strain is not A/Ann Arbor/6/60 and whichinfluenza strain comprises one or more phenotypic attributes selectedfrom the group consisting of: attenuated, cold adapted and temperaturesensitive; and, at least one genome segment encoding an HA surfaceantigen wherein the surface antigen polypeptide comprises the amino acidsequence of SEQ ID NO: 29, ii) culturing the population of host cells ata temperature less than or equal to 35° C.; and, iii) recovering aninfluenza virus.
 25. An immunogenic composition comprising animmunologically effective amount of the influenza virus produced by themethod of claim
 24. 26. A method for stimulating the immune system of anindividual to produce a protective immune response against influenzavirus, the method comprising administering to the individual animmunologically effective amount of the influenza virus produced by themethod of claim 24 in a physiologically effective carrier.
 27. A methodfor stimulating the immune system of an individual to produce aprotective immune response against influenza virus, the methodcomprising administering to the individual the immunogenic compositionof claim
 25. 28. A live attenuated influenza vaccine comprising theimmunogenic composition of claim
 25. 29. A split virus or killed virusvaccine comprising the immunogenic composition of claim 25.